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MUTANT PROTEOLYTIC ENZYMES FROM BACILLUS 

BACKGROUND OF THE INVENTION 

1 . Field of t he Invention 

This invention relates to mutant proteolytic enzymes 
having improved properties relative to the wild-type 
enzyme, to genetic constructs which code for the mutant 
5 proteolytic enzymes, to methods of predicting mutations 
which enhance the stability of the enzyme, and to methods 
of producing the mutant proteolytic enzymes. 

2. Description of the Related Art 

Subtilisins are a family of extracellular proteins 
10 having molecular weights in the range of 25,000-35,000 
daltons and are produced by various Bacillus species. These 
proteins function as peptide hydrolases in that they 
catalyze the hydrolysis of peptide linkages in protein 
substrates at neutral and alkaline pH values. Subtilisins 
15 are termed serine proteases because they contain a specific 
serine residue which participates in the catalytic 
hydrolysis of peptide substrates. A subtil isin enzyme 
isolated from soil samples and produced by Bacillus lentus 
for use in detergent formulations having increased protease 
20 and oxidative stability over commercially available enzymes 
under conditions of pH 7 to 10 and at temperature of 10 to 
60 °C in aqueous solutions has been disclosed in copending 
patent application serial number 07/398,854, filed on 
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8/25/89. Thi. B. lentus alkaline protease enzyme ' (BLAP, 
vide infra) is obtained in commercial quantities by 
cultivating a Bacillus licheniformis ATCC "926 strain 
which had been transformed by an expression Pj^*J** 
contained the wild type BLAP gene and the B. llchenitormis > 
ATCC 53926 alkaline protease gene promoter. 

industrial processes generally are performed under 
physical conditions which require highly stable enzymes 
Enzymes may be inactivated by high temperatures, pH 
"trTes, oxidation, an* surfactants. Even though Bacillus 
subtilisin proteases are currently used in many 
applications, including detergent formulations, stability 
improvements are still needed. Market trends are toward 
ffi ore concentrated detergent powders, and an increase in 
Uquid formulations. Increased shelf stability and 
oxidative stability, with retention of catalytic efficiency 
are needed. It is therefore desirable to isolate novel 
enzymes with increased stability, or to improve the 
stability of existing enzymes, including subtilisin 



10 



15 



20 proteases such as BIAP. 

The stability of a protein is a function of its three 
dimensional structure. A protein folds into a three 
dimensional conformation based upon the primary amino acid 
sequence, and upon its surrounding environment. The 

25 function and stability of a protein are a direct result of 
its three dimensional structure. 

A large body of information has been published which 
describes changes in enzyme properties as a result of 
alterations in the primary amino acid sequence of the 

30 enzyme. These alterations can result from random or site 
specific alterations of the gene which expresses the enzyme 
using genetic engineering techniques. Random approaches 
*utagenize total cellular DNA, followed by selection for 
the synthesis of an enzyme with improved properties. This 

35 approach requires neither knowledge of the three 
dimensional structure of the enzyme, nor any P^diotive 
capability on the part of the researcher. Site directed 
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mutagenesis, on the other hand, requires a rational 
approach for the introduction of amino acid changes. In 
this approach one or more amino acids may be replaced by 
other residues by altering the DNA sequence which encodes 
5 the protein. This can, be accomplished using 

oligonucleotide directed in vitro mutagenesis. The 
following references teach site-directed mutagenesis 
procedures used to generate specific amino acid 
substitution (s): Nines, J.C., and Ray, D.S. (1980) Gene 
10 11:207-218; Zoller, M.J., and Smith, M. (1982) Nucleic 
Acids Res. 10:6487-6500; Norrander, J., et al. (1983) Gene 
26:101-106; Morinaga, Y. , et al. (1984) Bio/Technology 
2:636-639; Kramer, W., et al. (1984) Nucleic Acids Res. 
12:9441-9456; Carter, P., et al. (1985) Nucleic Acids Res. 
15 13:4431-4443; Kunkel, T.A. (1985) Proc. Natl. Acad. Sci. 
USA 82:488-492; Bryan, P., et al. (1986) Proc. Natl. Acad. 
Sci. USA 83:3743-3745. 

A rational approach may or may not require knowledge of 
a protein's structure. For example , patent application W0 
20 89/06279 describes the comparison of the primary amino acid 
sequence of different subtilisins while contrasting 
differences in physical and chemical properties. The 
primary amino acid sequences of the different subtilisins 
are aligned for the greatest homology, while taking into 
25 account amino acid insertions, deletions, and total number 
of amino acids. 

Currently, the amino acid sequences of at least 10 
subtilisin proteases have been published. Eight of these 
subtilisins vere isolated from species of Bacilli, and 
30 include subtilisin 168 (Stahl, M.L., and Ferrari, E. (1984) 
J. Bacterid. 158:411-418), subtilisin BPN' (Vasantha, N., 
et al., (1984) J. Bacterid. 159:811-819), subtilisin 
Carlsberg (Jacobs, M. , et al. (1985) Nucleic Acids Res. 
13:8913-8926), subtilisin DY (Nedkov, P., et al. (1985) 
Biol. Chem. Hoppe-Seyler 366:421-430), subtilisin 
amylosacchariticus (Kurihara, M. , et al. (1972) J.Biol. 
Chem. 247:5619-5631), subtilisin mesenticopeptidase 



35 
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(Svendsen, X... et al. (1986) FEBS Lett. 196:228-232), 
subtilisin 147 and subtil is in 309 (Hastrup et al. (1989) WO 
89/06279) , subtilisin PB92 (Van Eekelen et al. (1989) BP 
0328229), and subtilisin BLAP (Ladin, B., et al. (1990) 
5 society for Industrial Microbiology Annual Meeting, 
Abstract P60) . The remaining two subtilisin sequences are 
thermitase from the fungus Thermoactinoinyces vulgaris 
(Meloun, B., et al. (1985) FEBS Lett. 183:195-200), and 
proteinase K from the fungus Tritirachium album lOober 
10 (Jany, K.-D., and Mayer, B. (1985) Biol. Chem. Hoppe-Seyler 

366:485-492). 

Methods for obtaining optimum alignment of homologous 
proteins are described in Atlas of Protein Sequence and 
Structure, Vol. 5, Supplement 2 (1976) (Dayhoff , M.O. , ed. , 

15 Natl. Biomed. Res. Found., Silver Springs, MD) . This 
comparison is then used to identify specific amino acid 
alterations which might produce desirable improvements in 
the target enzyme. Wells, J.A., et al. (1987) Proc. Natl. 
Acad Sci. USA 84:1219-1223, used primary sequence 

20 alignment to predict site directed mutations which affect 
the substrate specificity of a subtilisin. Using the 
alignment approach WO 89/06279 teaches the construction of 
mutant subtilisins having improved properties including an 
increased resistance to oxidation, increased proteolytic 

25 activity, and improved washing performance for laundry 

detergent applications. Patent applications WO 89/09819, 
and WO 89/09830 teach improvement in the thermal stability 
of subtilisin BPN' by the introduction of one or more amino 
acid changes based on the alignment of the primary ammo 

30 acid sequences of subtilisin BPN' with the more thermal 
stable subtilisin Carlsberg. From hereon, amino acids will 
be referred to by the one or three letter code as defined 
in Table 1. 
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TABLE 1 

On* and Thr«. L«tt«r Cod* for Aaino Acid* 



A « 


Ala 




Alanine 


C * 


Cys 


m 


Cysteine 


u *^ 


Asp 


as 


Aspartic acid or aspartate 


El 


Glu 




Glutamic acid or glutamate 


F s 

r 






Phenylalanine 








ftl vcine 


n — 


His 
nxo 




Histidine 


T — 

x — 


Tie 




Isoleucine 


K = 


Lys 




Lysine 


L = 


Leu 




Leucine 


M = 


Met 




Metnxonine 










P = 


Pro 




Proline 


Q - 


Gin 




Glutanine 


R = 


Arg 


— 


Arginine 


S = 


Ser 




Serine 


T s= 


Thr 




Threonine 


V ■ 


Val 




Valine 


W = 


Trp 




Tryptophan 


Y « 


Tyr 


s 


Tyrosine 



Rational mutational approaches may also predict 
mutations which improve an enzyme property based upon the 

25 three dimensional structure of an enzyme, in addition to 
the alignment of primary amino acid sequences described 
above. One method for determining the three dimensional 
structure of a protein involves the growing of crystals of 
the protein, followed by X-ray crystal lographic analysis. 

30 This technique has been successfully used to determine 
several high resolution subtilisin structures such as 
thennitase (Teplyakov, A.V., et al. (1990) 214:261-279), 
subtilisin BPN' (Bott, R. , et al. (1988) J. Biol. Chem. 
263:7895-7906) and subtilisin Carlsberg (Bode, W. , et al. 

35 (1986) EMBO J. 5:813-818), for example. 

EP 0251446 teaches the construction of mutant carbonyl 
hydrolases (proteases) which have at least one property 
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different from the parental carbonyl hydrolase. It 
describes nutations which effect (either improve or 
decrease) oxidative stability, substrate specificity, 
catalytic activity, thermal stability, alkaline stability, 
5 pH activity profile, and resistance to . autoproteolysis. 
These mutations were selected for introduction into 
Bacillus amyloliquefaciens subtilisin BPN' after alignment 
of the primary sequences of BPN' and proteases from B, 
subtilis, B. licheniformis , and thermitase. Such alignment 

10 can then be used to select amino acids in these 

other proteases which differ, as substitutes for the 
equivalent amino acid in the B. amyloliquefaciens carbonyl 
hydrolase. This application also describes alignment on 
the basis of a 1.8 A X-ray crystal structure of the B. 

15 amyloliquefaciens protease. Amino acids in the carbonyl 

hydrolase of B. amyloligue/aciens which when altered can 
affect stability, substrate specificity, or catalytic 
efficiency include: Met50, Metl24, and Met222 for 
oxidative stability; Tyrl04, Alal52, Glul5f , Glyl66, 

20 Glyl69, Phel89, and Tyr217 for substrate specificity; N155 
alterations were found to decrease turnover, and lower Km; 
Asp36, Ilel07, Lysl70, Aspl97, Ser204, Lys213, and Met222 
for alkaline stability; and Metl99, and Tyr21 for thermal 
stability. Alteration of other amino acids was found to 

25 affect multiple properties of the protease. Included in 

this category are Ser24, Met50, Aspl56, Glyl66, Glyl69, and 
Tyr217. Substitution at residues Ser24, Met5p, Ilel07, 
G1U156, Glyl66, Glyl69, Ser204, Lys213, Gly215, and Tyr217 
was predicted to increase thermal and alkaline stability. 

30 An important point about this patent application is that 
with the exception of those mutations effecting substrate 
specificity, no rational mutational approach for improving 
the alkaline or temperature stability of a protease based 
upon computer simulations of an X-ray crystal structure is 

35 described. 

WO 88/08028 teaches a method for redesigning proteins 
to increase stability by altering amino acid residues that 
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are in close proximity to the protein's metal ion binding 
site. This application describes the alteration of a 
calcium ion binding site present within subtilisin BPN* 
through the substitution, insertion, or deletion of amino 
5 acid residue (s) in close proximity to that site so that the 
electrostatic attraction between the amino acids and the 
calcium ion is increased. The characterization of the 
calcium ion binding site is accomplished through the 
analysis of a 1.3 A three dimensional structure of 
10 subtilisin BPN 1 using a high resolution computer graphics 
system. This approach allows the selection of amino acids 

acceptable for replacing the native amino acids in the 

# 

protease by first simulating the change using the computer 
model. This allows for the identification of any problems 

15 including steric hindrance prior to the actual construction 
and testing of the mutant proteases. 

US patents 4908773 and 4853871 teach a computer based 
method for evaluating the three dimensional structure of a 
protein to select amino acid residues where the 

20 introduction of a novel disulfide bond will potentially 

stabilize the protein. Potentially acceptable amino acid 
residues can then be ranked, and replaced using computer 
simulation, prior to the actual construction of the mutant 
protein using site directed mutagenesis protocols. 

25 Several patent applications combine published data on 

biochemical stability with computer analysis of three 
dimensional protease structures in order to predict 
mutations which stabilize the enzyme. US 4,914,031 and WO 
88/08033 and WO 87/04461 teach a method for improving the 

30 pH and thermal stability of subtilisin aprA by replacing 
asparagine residues present in asparagine/glycine pairs. 
Asparagine/glycine pairs in proteins have been shown to 
undergo cyclization to form cyclic imide 
anhydroaspartylglycine (Bornstein, P., and Balian, G. 

35 (1977) Methods Enzymol. 47:132-145). This cyclic imide is 

susceptible to base hydrolyzed cleavage leading to 
inactivation of the enzyme. Computer analysis of the three 
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dimensional structure of the apr A protease also predicted 
that formation of the cyclic imide could lead to protease 
inactivation resulting from a shift of the side chain of 
the active site serine. The decision to replace the 
5 asparagine residue and not the glycine residue was based 
upon alignment of the aprA sequence with other Subtilisin- 
like enzymes, cucumisin and proteinase K. 

Sensitivity to oxidation is an important deficiency of 
serine proteases used in detergent applications (Stauffer, 

10 C.E., and Etson, D. (1969) J. Biol. Chem. 244:5333-5338). 

EP 0130756, EP 0247647, and US 4 , 760, 025 teach a saturation 
mutation method where one or multiple mutations are 
introduced into the subtilisin BPN' at amino acid residues 
AS P 32, Asnl55, Tyrl04, Ket222, Glyl66, His64 , Ser221, 

15 Glyl69, G1U156, Ser33, Phel89, Tyr217, and/or Alal52. 

Using this approach mutant proteases exhibiting improved 
oxidative stability, altered substrate specificity, and/or 
altered pH activity profiles are obtained. A method is 
taught in which improved oxidative stability is achieved by 

20 substitution of methionine, cysteine, tryptophan, and 
lysine residues. These publications also teach that 
mutations within the active site region of the protease are 
also most likely to influence activity. Random or selected 
mutations can be introduced into a target gene using the 

25 experimental approach but neither EP 0130756, EP 0247647, 
nor US 4,760,025 teach a method for predicting amino acid 
alterations which will improve the thermal or surfactant 
stability of the protease. 

WO 8705050 teaches a random mutagenesis approach for 

30 construction of subtilisin mutants exhibiting enhanced 
thermal stability. One or more random mutations are 
introduced into single stranded target DNA using the 
chemical mutagens sodium bisulfite, nitrous acid, and 
formic acid. Subsequently, the mutated DNA is transformed 

35 into a Bacillus host and at least 50,000 colonies are 
screened by a filter assay to identify proteases with 
improved properties. Site directed mutagenesis can then be 
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used to introduce all possible nutations into a site 
identified through the random mutagenesis screen. No 
method for pre selection of amino acids to be altered is 
taught. 

5 EP 0328229 teaches the isolation and characterization 

of PB92 subtilisin mutants with improved properties for 
laundry detergent applications based upon wash test 
results. It teaches that biochemical properties are riot 
reliable parameters for predicting enzyme performance in 

10 the wash. Methods for selection of mutations involve the 
substitution of amino acids by other amino acids in the 
same category (polar, nonpolar, aromatic, charged, 
aliphatic, and neutral) , the substitution of polar amino 
acids asparagine and glutamine by charged amino acids, and 

15 increasing the anionic character of the protease at sites 
not involved with the active site. No method for 
identifying which specific amino acids should be altered is 
taught, and no rational mutational approach is taught which 
is based on alignment of X-ray structures of homologous 

20 proteases with different properties. 

EP 0260105 teaches the construction of subtilisin BPN' 
mutants with altered transesterif ication rate/hydrolysis 
rate ratios and nucleophile specificities by changing 
specific amino acid residues within 15 A of the catalytic 

25 triad. Russell, A.J., and Fersht, A.R. (1987) Nature 
328:496-500, and Russell, A.J., et al. (1987) J. Mol. Biol. 
193:803-813, teach the isolation of a subtilisin BPN' 
mutant (D099S) that had a change in the surface charge 14- 
15 A from the active site. This substitution causes an 

30 effect on the pH dependence of the subtilisin's catalytic 
reaction. 

There are a number of different strategies for 
increasing protein stability. Many of these methods suggest 
types of substitutions to improve the stability of a 
35 protein but do not teach a method for identifying amino 
acid residues within a protein which should be substituted. 
From entropic arguments, many types of substitutions have 
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been suggested such as Cly to Ala and any amino acid to Pro 
. (Matthews, B.W., et al. (1987) Proc. Natl. Acad. Scl. 
84:6663-6667). Likewise, while it is clear that increasing 
the apolar size of an anino acid in the core will add to 
stability, adverse packing effects nay sore than conpensate 
for the hydrophobic effect, resulting in a decrease in 
protein stability (Sandberg,W.S. , and Terwilliger, T.C. 
(1989) Science 245:54-57). Henendez-Arias, L., and Argos, 
P. (1990) J. Mol. Biol. 206:397-406, performed a 
statistical evaluation of anino acid substitutions of 
thermophilic and mesophilic molecules and proposed that 
decreased flexibility and increased hydrophobicity in the 
o-helical regions contributes most towards increasing 
protein stability. From their data, they formulated a set 
15 of empirical rules to improve stability. 

Increasing the hydrophobicity of certain side chains 
has long been suggested as a means to improve protein 
stability. The hydrophobic exclusion of nonpolar amino 
acids is the largest force driving protein folding. This 
has been studied by examining the partitioning of amino 
acids or amino acid analogs from water to a hydrophobic 
medium. While the numbers vary depending on the work, 
these studies generally agree that burying a hydrophobic 
side chain increases protein stability. For example, 
Kellis, J.T., Jr., et al. (1988) Nature 333:784-786, 
estimated that the removal of a methyl group destabilizes 
the enzyme by 1.1 kcal/mole assuming no other structural 
perturbations occur. Conversely, this predicts that the 
addition of a methylene group should add 1.1 kcal/mOl if no 
unfavorable contacts occur. Similarly, Sandberg, W.S., and 
Terwilliger, T.C. (1989) Science 245:54-57, showed that the 
effect of removing or adding methylene groups is the sum of 
the hydrophobic effect and structural distortions. Simply 
adding buried hydrophobic groups may not increase protein 
stability because the total effect of adding or deleting a 
methyl group on the local packing structure must be 
considered. As the protein interior has a para-crystalline 
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structure (Chothia, C. (1975) Nature 254:304-308) , email 
distortions in the remainder of the structure resulting 
from the addition methyl group may exact a high cost and 
reduce rather than increase stability. 
5 Along the same lines, the core of X repressor has been 

shown to be amazingly tolerant to apolar amino acid 
substitutions in a functional assay (Bowie, J.U., et al. 

(1990) Science 247:1306-1310). It is not clear that thfs 
is true for larger proteins. The constraints on the 

10 hydrophobic core of a small protein may be less stringent 
than a larger protein simply due to the volume of the core 
relative to the number of amino acids which need to pack 
into the region. As the volume of the hydrophobic core 
increases, the number of amino acids which must pack 

15 together correctly increases, requiring more specific 
nonlocal interactions. 

It has been recognized that increasing the interior 
hydrophobicity of a protein as a means of increasing the 
stability is hampered by the difficulty of determining 

20 which positions in the protein will lead to stabilization 
when substituted (Sandberg, W.S., and Terwilliger, T.C 

(1991) Trends Biotechnol. 9:59-63). The methods discussed 
above provide a means of determining what substitutions to 
make to improve stability but do not identify which sites 

25 in the protein are most important. The present invention 
provides a method of determining which positions in the 
protein will lead to stabilization when substituted. 

SUMMARY 07 THE INVENTION 
Other than in the operating examples, or where 
30 otherwise indicated, all numbers expressing quantities of 
ingredients or reaction conditions used herein are to be 
understood as modified in all instances by the term 
"about". 

The native or wild-type protease from which the mutant 
35 proteases according to the invention are derived is a B. 

ientus alkaline protease (BLAP) obtained from B. lentus 
DSM 5483 having 269 amino acid residues, a molecular mass 
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of 26,823 daltons, and a calculated isoelectric point of 
9.7 based on standard pK values. The BLAP gene is obtained 
by isolating the chromosomal DNA from the fl. lentus strain 
DSM 5483, constructing DNA probes having homology to 
5 putative DNA sequences encoding regions of the B. lentus 
protease, preparing genomic libraries from the isolated 
chromosomal DNA, and screening the libraries for the gene 
of interest by hybridization to the probes. 

Mutant B . lentus DSM 5483 proteases have been made 

10 which are derived by the replacement of at least one amino 
acid residue of the mature form of the B. lentus DSM 5483 
alkaline protease. The sites for replacement are selected 
from the group consisting of Ser3, Val4, Ser36, Asn42, 
Ala47, Thr56, Thr69, G1U87, Ala96, AlalOl, Ilel02, Serl04, 

15 Asnll4, Hisll8, Alal20, Serl30, Serl39, Thrl41, Serl42, 
Serl57, Alal88, Vall93, Vall99 f Gly205, Ala224, Lys229, 
Ser236, Asn237, Asn242, His243, Asn255, Thr268. The 
replacement amino acid residues are listed in Table 2. The 
numbering of the mutant proteases is based on the 

20 B. lentus DSM 5483 wild-type protease as given in the SEQ 
ID NO:52. 

Genes which express the mutant B. lentus DSM 5483 
proteases according to the invention are made by altering 
one or more codons of the wild-type B. lentus DSM 5483 

25 alkaline protease gene which encode for a protease derived 
by accomplishing at least one of the amino acid 
substitutions listed in Table 2 . 

The protease sites listed in Table 2 are sites 
predicted to affect thermal and surfactant stability 

30 relative to the wild-type protease. These sites are 
identified by means of a computer based method which 
compares the three dimensional structure of the wild-type 
protease (henceforth, the target protein) and a homologous 
protease (henceforth, the reference protein). The three 

35 dimensional coordinates of the wild-type protease are 
probed with an uncharged probe molecule to produce a probe- 
accessible surface which has an external surface the 
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interior of which contains one or »ore probe-accessible 
internal cavities. The amino acids of the reference protein 
having side chains lying outside the solvent-accessible 
surface or inside the internal cavities of the target 
5 protein are identified by aligning the three dimensional 
coordinates of the target protein and the reference 
protein. 

Proteins having greater thermal and surfactant 
stability are produced by replacing the amino acid in the 
10 target protein if the amino acid in the target protein can 
be changed without creating unacceptable steric effects. 
The amino acid in the target protein is altered by site 
directed mutagenesis of the gene which expresses the target 
protein. 

15 Genetic constructs are made which contain in the 

direction of transcription a promoter, ribosomal binding 
site, initiation codon and the major portion of the pre 
region of the Bacillus licheniformis ATCC 53926 alkaline 
protease gene operably linked to a portion of the pre 

20 region and all of the pro and mature regions of the 
Bacillus lentus DSM 5483 alkaline protease gene followed by 
a 164 bp DNA fragment containing the transcription 
terminator from the ATCC 53926 alkaline protease gene. The 
Bacillus lentus DSM 5483 alkaline protease gene is altered 

25 to produce a mutant gene which encodes for a protease 
derived by accomplishing at least one of the amino acid 
substitutions listed in Table 2. Mutant protease is made by 
fermenting a BaciJlus strain transformed with a genetic 
construct containing a mutated Bacillus lentus DSM 5483 

30 alkaline protease gene. 

BRIE? DESCRIPTION OP THE PIGORBS 
Figure 1 shows the atomic coordinates for Bacillus 
lentus alkaline protease (BLAP) to 1.4 A resolution. 

Figure 2 shows the restriction map for plasmid pCB13C 
35 which contains a hybrid gene fusion between the Bacillus 
licheniformis ATCC 53926 protease gene and the Bacillus 
lentus DSM 5483 BIAP gene. The promoter, ribosomal binding 
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site and presequence (P-53926) from ATCC 53926 were fused 
to the pro- and mature sequence of the BLAP gene. The 
transcription terminator of ATCC 53926 (T-53926) was 
appended to the BLAP coding region. 
5 Figure 3 shows the restriction map for plasmid pMcl3C 

which is derived from P Mac5-8 and contains the BLAP gene 
and carries an amber mutation in the Ap* gene which renders 

it inactive. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

10 one aspect of the invention relates to mutant 

proteolytic enzymes which have superior thermal stability 
and surfactant stability relative to the wild-type protease 
as determined by laboratory tests. The mutant ' proteases 
according to the invention are those derived by the 

15 replacement of at least one amino acid residue of the 
mature Bacillus lentus DSM 5483 alkaline protease wherein 
said one amino acid residue which is selected from the 
group consisting of Ser3, Val4 f Ser36, Asn42, Ala47, Thr56 f 
Thr69, Glu87, Ala96, AlalOl, Ilel02, Serl04, Asnll4, 

20 HisllS, Alal20, Serl30, Serl39, Thrl41, Serl42, Serl57, 
Alal88, Vall93, Vall99, Gly205, Ala224, Lys229, Ser236, 
Asn237, Asn242, His243, Asn255, Thr268 is replaced with the 
amino acid residues listed in Table 2. Table 2 shows the 
identity and position of the wild-type amino acid and the 

25 amino acid residue(s) which replace . it in the mutant 
protein. For example, the first entry in Table 2 shows 
Ser3, a serine residue at position 3 which can be replaced 
by threonine (abbreviated as T using the one letter code 
for amino acids) or any small amino acid. A small amino 

30 acid is defined as glycine, alanine, valine, serine, 
threonine or cysteine. A small hydrophobic amino acid is 
defined as glycine, alanine, threonine, valine or 
isoleucine. A charged amino acid is defined as lysine, 
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arginine, histidine, glutanate or aspartate, 
abbreviation a. a. stands for "amino acid" residue. 

TABLE 2 



Residue 


Replacement Amino ^Acld 


Ser3 


T 


or 


any small, hydrophobic a. a. 


Val4 


I. 


p s 


or any small a. a. 




Ai 


t T 


or any small a. a. 


Del H £ 


F < 
* i 


, A 


. t. v. i. y 


nidi / 


W 




anv small a«a* except A 




v 


c 

w 


or anv small, hydrophobic a. a. 

^ %H mj mm mm m J ^ mmmm m*- m m> wmw mm* w w 


inroy 


p 




or anv charaed a. a. 


blUO / 


p 


M 


or anv charaed d«a« 


AX97D 


T 


u 

"i 


^ at* anv small, hvdroohobic a. 


Aiaiui 


• # 


c 


or anv ^mali hvdroDhobic a. a. 


Iiei02 


ri 


or 


anv cma 11 a a excfiDt P 
any small a « a • ^^.wc^w * 


seriu4 


rp 
1 


or 


anv email hvHynnhnbi c a. a. 
any small, nyuivpiiuwiw a* a* 


Asnll4 


e 


Q 


Oi einjr small ; iiy ui u^iiuuxu a* a* 


H1S118 


r 


or 


any a. a. except P and W 


Aiai20 


V 


or 


any small, hydrophobic a. a. 


seri30 


A, 


T 


or any small, hydrophobic a. a. 


sen 


A 


T| 


, y or any a. a. except P and W 


Tnri4i 


u 


or 


any a. a. except P 


Serl42 


* 

A, 


T 


or any small, hydrophobic a. a. 


Serl57 


T 


or 


any small, hydrophobic a. a. 


Alal88 


r 


or 


any small, hydrophobic a. a. 


Vall93 


M 

n 


or 


any small , hydrophobic a. a. 


Vail99 


I 


or any small, hydrophobic a. a. 


Gly205 


V 


or 


any small, hydrophobic a. a. 


Ala224 


V 


or 


any small, hydrophobic a. a. 


Lys229 


W 


or 


any a. a. except P 


Ser236 


A, 


T 


or any small, hydrophobic a. a. 


Asn237 


A, 


Hf 


Q, M or any small, hydrophobic 


Asn242 


A, 


N, 


Q, M or any small, hydrophobic 


His243 


A, 


N, 


Q, M or any small, hydrophobic 


Asn255 


P 


or 


any small, hydrophobic a. a. 


Thr268 


V 


or 


any small, hydrophobic a. a. 
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The amino acid sequences of the preferred proteolytic 
enzymes are given in SEQ ID N0:1 to SEQ ID NO: 51. The 
preferred mutated B. lentus DSM 5483 proteases which are 
encoded for by genes according to the invention as 
disclosed above are given in SEQ 10 NO: 53 to 105. These 
proteases are produced by bacterial strains which have been 
transformed with plasmids containing a native or hybrid 
gene, mutated at one or more nucleotide base pairs by known 
mutagenesis methods. These mutant genes encode for 
proteases in which selected amino acid residues have been 
substituted for by other amino acids. 

The mutant proteases according to the invention are 

listed in Table 3. 

Table 3 

Mutation Temperature stability Sp? stabjUtV 

50 # C, 60»C, pH 10.5, pH 8.6, 

pH 11.0 pH 10.0 50«C S0»C 
t>j (min) tij (min) t% (rain) tJj(min) 



20 








3.2 


12 




S3T, V4I, A188P, V193K, V199I 


120 


67 




S3T, A188P, V193M, V199I 


95 


60 


3.75 


18.5 




V4I, A188P, V193K, V199I 


72 


39 


1.75 


3.75 




S139Y, A188P, V193M, V199I 


69 


■ 33 


1.4 


4.6 


25 


S130T, S139Y, A188P, V193H, V199I 


64 


22 


2 


6.3 


A188P, V193M, V199I 


55 


23.5 


3.0 


12.5 




S3T, A188P, V193M 


54 


21 


1.5 


3.4 




S157T 


52 


17.5 


1.2 


0.95 




A188P, V193M 


50 


27 


2.5 


7.25 


30 


A188P 


48 


19 


1.4 


2.8 




S3T, V4I, A188P, V193M 


43 


21 


1.4 


3.7 




V193M 


42 


16.6 


1.2 


3.0 




S104T 


42 


8 


1.0 


1.8 








1.8 




T69V 


41 


12.3 


0.8 


35 


V4I, A188P, V193M 


40 


19 


1.25 


2.7 




A224V 


39 


15 


0.9 


1.1 




V199I 


38.5 


11.6 


1.0 


2.0 




V4I 


32.5 


10 


0.75 


1.0 




S3T 


32 


6.6 


1.2 


2.8 


40 


S139Y 


26 


8.8 


1.0 


2.0 




N242A 


26 


7.4 


0.9 


1.9 




S236T 


25.5 


8.4 


1.0 


2.0 




S36A 


23.8 


8.6 


0.9 


1.8 
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TABLE 4 (cont.) 



Mutation Temperature Stability SOS Stability 

50 # C, 60 # C, pH 10.5, pH 8.6, 
pH 11.0 pH 10.0 50 # C 50*C 
tH (min) th (min) th (min) t% (min) 



H243A 


23 
*<# 


5.9 


0*8 


1.7 


A101T 


93 
d J 


4.7 


V • * 


2.75 


S236A 


2 J 




A ft 


1.3 


B87R 


22 .5 


7 • U 


A A 

e 


1 2 


N114S 


22 


7 • 7 


i i 

A . A 


1.3 


A47W 


21 


9 « d 


O Of 
V . 7 


1 Q5 

A . w w 


A120S 


20. 5 


a >t 

8.4 


U. 7 


l . e 


T56V 


20 


a c 

8.5 


A 0. 


■ A *7 
V. 7 


A120V 


20 


11.8 


0. ©5 


1.7 


6205V 


20 


6.8 


1.1 


i a 

2.8 


S130A 


20 


8.8 


0.4 


1.0 


S130T 


20 


7.2 


0.4 


« a 
1.1 


A96I 


19 


12 


1.0 


1.4 


S104T, S139Y, A224V 


n o 
lo 


7.3 




1 ft 


S139A 


18.5 


7.8 


0.5 


0.8 


S142T 


17.5 


11.5 


0.9 


1.7 


S139T 


16.5 


4.3 


0.5 


0.8 


I102W 


16.5 


7.2 


0.7 


1.6 


A96N 


18 


6 


0.9 


0.95 


N42F 


18 


5.9 


1.0 


1.4 


S142A 


16 


9 


1.0 


1.7 


H118P 


15.8 


5.1 


1.0 


1.3 


N237A 


15 


7.6 


0.67 


1.3 


N255P 


15.0 


5.3 


1.2 


1.25 


T141W, N237A 


14 


5.4 


0.33 


1.1 


T268V 


14 


3.8 


0.75 


1.1 


K229W 


13.4 


4.6 


1.0 


1.4 


T141W 


12 


6.5 


0.6 


1.4 


wildtype 


12.0 


3.0 


0.8 


1.6 



30 

Any of the proteases listed in Table 3 will exhibit greater 
stability in some manner than the wild-type protease BLAP. 
The entries under the "Mutation" heading of Table 3 shows 
the identity of the wild-type amino acid (using the one 
35 letter code), its position, and the amino acid which 
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replaces it in the mutant protease. For example, S3T 
signifies that the serine at position 3 of the nature 
protease is replaced with a threonine. Some of the 
preferred mutant proteases are single replacements at 
5 specific locations such as a protease wherein valine at 
position 4 is replaced by isoleucine to specific 
combinations of replacements such as a protease wherein 
threonine at position 141 is replaced by tryptophan and 
asparagine at position 237 is replaced by alanine. The 

10 latter protease containing two replacements is one of only 
a number of possibilities. 

The preferred mutant proteases according to the 
invention are identified as: (S3T, V4X, A188P, V193M, 
V199I); E87R; (S3T, A188P, V193M, V199I) ; N114S; (V4I, 

15 A188P, V193H, V199I) ; A47W; (S139Y, A188P, V193H, V199I) ; 

A120S; (S130T, S139Y, A188P, V193M, V199I) ; T56V; 
A120V; (A188P, V193M, V199I) ; G205V; (S3T, A188P, V193M) ; 
S130A; S130T; S157T; A96I; (S104T, S139Y, A224V) ; S139A; 
S142T; S139T; I102W; V193M; A96N; N42F; S142A; H118F; 

20 N237A; N255P; (T141W, N237A) ; T268V; K229W; T141W; (A188P, 

V193M); V4I; S3T; S139Y; N242A; S236T; S36A; H243A; A101T; 
S236A; A188P; (S3T, V4I, A188P, V193M) ; V193M; S104T; T69V; 
(V4l/ A188P, V193M); A224V; V199I. The system used to 
designate the above preferred proteases first lists the 

25 amino acid residue in the mature form of the B. lentus DSM 
5483 alkaline protease at the numbered position followed by 
the replacement amino acid residue using the one letter 
codes for amino acids. For example, V193M is a protease in 
which valine has been replaced by methionine at position 

30 193 of the mature B. lentus DSM 5483 alkaline protease. A 
mutant protease identified by more than one such 
designation is a mutant protease which contains all of the 
indicated substitutions. For example, (A188P, V193M) is a 
protease in which valine has been replaced by methionine at 

35 position 193 of the mature B. lentus DSM 5483 protease and 
alanine at position 188 has been replaced by proline. 
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Mutant forms of the B. lentus DSM 5483 alkaline 
protease are prepared by site-specific mutagenesis of DMA 
encoding the mature form of either wild-type BLAP, or a 
mutant BIAP* The DNA fragment encoding the mature form of 
5 wild type BLAP was prepared using plasmid pCB13C. Plasmid 
PCB13C contains a hybrid fusion between the 
B. licheniformis ATCC 53926 protease gene and the B. lentus 
DSK 5483 BLAP gene, shown in Figure 2. Specifically, this 
hybrid fusion contains DNA encoding the promoter, ribosomal 

10 binding site, and 21 residues of the pre sequence from the 
ATCC 53926 protease gene fused to a DNA sequence encoding 
the last five residues of the BLAP pre sequence and all of 
the pro and mature residues of BLAP. This fusion is 
referred to as the Cial fusion because this restriction 

15 site is located at the juncture between the ATCC 53926 and 
DSM 5483 DNA 1 s • A new Clal restriction site had to be 
introduced into the ATCC 53926 alkaline protease gene near 
to the junction of the pre and pro sequences. The CJal 
site was introduced into the ATCC 53926 alkaline protease 

20 gene by using a polymerase chain reaction (PCR) to amplify 

a DNA fragment containing sequence information from the N- 
terminal part of the ATCC 53926 alkaline protease gene. 
The amplified fragment included the ATCC 53926 alkaline 
protease promoter, ribosomal binding site, initiation 

25 codon, and most of the pre sequence.. This 292 bp DNA 

fragment was flanked by Aval and clal restriction sites at 
its 5 f and 3 1 ends, respectively. The BLAP gene already 
contained a naturally occurring Clal site at the 
corresponding position. Analysis of the DNA sequence 

30 across the fusion of the ATCC 53926 and BLAP genes 
confirmed the expected DNA and amino acid sequences. 

Before any mutagenesis can be carried out, the gene is 
subcloned into the mutagenesis vector pMa5-8. This is 
accomplished by synthesizing a DNA fragment containing the 

35 Clal fusion gene and the ATCC 53926 transcription 
terminator as a Sail cassette using the PCR. The PCR was 
carried out using conditions as described by the 
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manufacturer (Perkin Elmer Cetus, NorwalX, CT.). In the 
PCR, two synthetic oligonucleotides bearing Sail sites are 
used as primers and Escherichia coli vector pCB13C DNA as 
a template. After cutting the PCR product with Sail, this 
5 fragment is cloned into the mutagenic plasmid pMc5-8 which 
has previously been cut with Sail and dephosphorylated with 
bacterial alkaline phosphatase . Plasmids pMc5-8 , and pMa5- 
8 described below were obtained from H.-J. Frits and are 
described by Stanssens, P., et al. (1989) Nucleic Acids 

10 Res. 17:4441-4454. Sail sites are chosen to allow the PCR 
fragment to be cloned into pMc5-8 in both orientations. The 
ligation mix is transformed into E. coli WK6. 
Chloramphenicol resistant (Cm R ) transformants are screened 
for the presence of an insert and a correct plasmid 

15 construct P Mcl3C is identified as shown in Figure 3. Once 
the gene is cloned into the pMc vector and desirable sites 
for mutation are identified, the mutation (s) is introduced 
using synthetic DNA oligonucleotides according to a 
modification of a published protocol (Stanssens, P., et al. 

20 (1989) Nucleic Acids Res. 17:4441-4454). The 

oligonucleotide containing the nutation (s) to be introduced 
is annealed to a gapped duplex (gd) structure which carries 
the BLAP gene on a segment of single stranded (ss) DNA. 
The gapped duplex can be formed by annealing linear ss DNA 

25 from pMcl3C with denatured and restricted pMa5-8 DNA. 

Plasmid pMa5-8 contains an active ampicillin resistance 
gene but has an inactivating point mutation in the 
chloramphenicol resistance gene, whereas plasmid pMcl3C 
contains, in addition to an intact BLAP gene, an active 

30 chloramphenicol resistance gene, but has an inactivating 
point mutation in the ampicillin resistance gene. The 
annealed product is the gd DNA which is a double stranded 
heteroduplex with a ss DNA gap spanning the entire cloned 
BLAP gene. The mutant oligonucleotide is able to anneal to 

35 homologous ss BLAP DNA within the gap and the remaining gap 
is filled in by DNA polymerase I (Klenow fragment) and 
ligated using T4 DNA ligase, purchased from New England 
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Biolabs Inc., Beverly, Ma. The mutagenic efficiency of 
such a system can be improved by the use of Exonuclease III 
(Exo III) purchased from New England Biolabs Inc., Beverly, 
MA. Exo III is an exodeoxyribonuclease that digests 
5 double stranded DMA from the 3' end. As a free 3 • end is 
required, closed circular ss DNA or ds DNA is unaffected by 
this enzyme. A subsequent treatment of the product of the 
fill-in reaction with Exo III removes any species with only 
partially filled gaps. This significantly improves the 

10 mutagenic efficiency and is the preferred mutagenesis 
method. The product of the fill-in reaction is then 
transformed into a repair deficient E. coli strain such as 
WK6mutS and ampicillin resistant transf ormants (Ap R ) are 
selected. Replication of the transformed heteroduplex 

15 phasmid results in two different progenies. One progeny 
contains the wild type BLAP gene and the intact 
chloramphenicol resistance gene, but an inactive ampicillin 
resistance gene. The other progeny contains a BLAP gene 
carrying the mutation of interest and is resistant to 

20 ampicillin but not to chloramphenicol. 

Selection of Ap R , Cm 5 mutant transformants with 
ampicillin is not sufficient to stop some background growth 
of the Ap s , Cm R progeny carrying the wild type BLAP gene. 
Therefore, it is necessary to perform a second 

25 transformation into E* coli using plasmid DNA prepared from 
the Ap R transf ormants of the WK6mutS strain. This second 
transformation uses a low plasmid concentration with a 
large number of recipient cells of a suppressor deficient 
strain of E. coli such as WK6. This approach decreases the 

30 likelihood of a recipient cell receiving plasmid DNA from 
both progeny. Ap* transf ormants are selected and plasmid 
DNA from several transf ormants is isolated and screened for 
the presence of the mutation. The pMa mutant derivative of 
the first mutagenesis round can be used for a second round 
35 of mutagenesis by preparing ss DNA of that species and 
annealing it to Xbal/Hindlll restricted and denatured DNA 
of pMc5-8. Plasmid pMc5-8 is identical to pMaS-8 except 
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that it contains an active chloramphenicol resistance gene 
and an inactive ampicillin resistance gene. The general 
procedure is the sane as that described above. 

The mutant BLAP proteases can be produced by 
5 transferring the mutant BLAP genes from their particular E. 
coli pMal3C derivative vector into a plasmid vector which 
can replicate in Bacillus. To accomplish this, the mutant 
BLAP genes are separated from their pMal3C plasmids by 
digestion with the restriction endonucleases Aval and SstI, 

10 followed by ligation to the larger Aval/Sstl fragment from 
either plasmid pH70 or pC51. These Aval/Sstl fragments 
from pH70 and pC51 include the DNA sequences necessary for 
replication in Bacillus and encode either ' kanamycin 
resistance (Km R ) or tetracycline resistance (Tc*) , 

15 respectively. Plasmid pH70 is constructed by cloning the 
ATCC 53926 alkaline protease gene carried on a BcoRI/BamHI 
DMA fragment into the Km* plasmid pOBHO between the ScoRl 
and BamHI sites. Plasmid pC51 is constructed by cloning 
the ATCC 53926 protease gene carried on a £coRI-BamRT 

20 fragment into the Tc R plasmid pBC16 between the EcoKL and 
BamHI sites. The larger Aval-SstI fragment from either 
pH70 or pC51 used for cloning the mutant BLAP genes is 
first purified from other DNA fragments by high pressure 
liquid chromatography (HPLC) on a Gen-Pak PAX column 

25 (Maters, Milford, MA) . The column is 4.6 mm by 100 mm in 

size and contains a polymer-based high performance anion- 
exchange resin. Conditions for elution of the DNA are a 
flow rate of 0.75 ml/min with a gradient of Buffer A (25 
mM tris(hydroxymethyl)aminomethane (Tris) pH 8 . 0 containing 

30 1 mM disodium ethylenediamine tetraacetic acid (EDTA) ) and 
Buffer B (25 mM Tris pH 8.0, 1 mM EDTA, 1 M NaCl) starting 
at 50% each and reaching a final concentration of 30% 
Buffer A and 70% Buffer B. 

After ligation the mutant BLAP plasmids are 

35 transformed into B. subtilis DB104. The genes encoding 
the major alkaline and neutral proteases present in this 
strain have been inactivated (Kawamura, P., and Doi, R.A. 
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(1984) J. Bacterid. 160:442-444). Cell* of B. subtilia 
DB104 transformed by these plasmids grow on a nutrient-ski* 
■ilk agar in the presence of either kanamycin or 
tetracycline. Transformants of DB104 that manufacture 
5 mutant protease are identified by the formation of clear 
zones of hydrolysis in the skim milk. Confirmation that the 
protease^producing transf ormants carry a plasmid-borne BIAP 
gene with the desired mutation (s) is accomplished by 
purifying plasmid DNA from a culture of each transf ormant . 

10 The plasmid DNA is purified away from cell protein and 
chromosomal DNA by SDS-salt precipitation followed by 
chromatography over a Qiagen ion-exchange column (Qiagen 
Corporation, Studio City, CA) . Aval-SstI digested plasmid 
DNAs from different transf ormants are compared with 

15 Aval/Sstl-digested derivatives of plasmid pH70 or pC51 
known to carry an intact BIAP gene. Restriction digests of 
these plasmids are compared by agarose gel electrophoresis 
to identify plasmids that have the proper-sized Aval/SstI 
DNA fragments. Selected plasmid DNAs are then seguenced 

20 across the region of the expected BIAP mutation (s) to 

confirm that the desired mutation (s) are present. One or 
more clones of each BIAP mutation are stored frozen in 15% 
glycerol at -70°C and also cultivated in shake flasks 
(Example 4, Production of Proteases) to produce mutant 

25 protease for characterization. 

Another aspect of the invention provides a computer 
based method for identifying the sites which affect the 
storage, thermal, SDS and pH stability of a protein. This 
method is based on the hypothesis that protein stability 

30 may be enhanced by decreasing the volume of internal 
cavities and improving surface packing of amino acid side 
chains. The interior of a protein contains many apolar 
amino acids which are tightly packed into a nearly 
crystalline state. One way in which these interior amino 

35 acids affect protein stability is through packing effects. 

These include van der Waal interactions, distortion of the 
remainder of the protein and electrostatic effects. 



SUBSTITUTE SHEET 



10 



15 



WO 92/21760 PCTAJS92/04306 

24 

Packing affects have been studied by measuring the 
contribution of .ethyl groups in the interior of a protein 
to the overall stability of the protein. It ha. been 
estimated that the removal of a methyl group fro. the 
5 interior of a protein destabilizes it by about 1.1 kcal/mol 
assuming no other perturbations occur (Kellis, J.T., Jr., 
et al. (1988) Nature 333:784-786). However, the inverse may 
not be true. Simply adding buried hydrophobic groups may 
not increase protein stability because the total effect of 
adding or deleting a methyl group on the local packing 
structure must be considered. As the protein interior has 
a para-crystalline structure (Chothia, C. (1975) Nature 
254:304-308), small distortions in the remainder of the 
structure resulting from the addition methyl group may 
exact a high cost and reduce rather than increase 
stability. 

While it is known in the art to make certain 
substitutions which may affect protein stability, there is 
no known way of identifying which sites in the protein will 
lead to stabilization when substituted. For example, it 
has been suggested that protein stability would be 
increased if alanine were substituted for glycine or 
serine; or if threonine were substituted for serine 
(Matthews, B.W., et al. (1987) Proc. Natl. Acad. Sci. 
84:6663-6667); or if proline were substituted for glycine. 
However, the sites in which one or more of these 
substitutions should be made has been so far unpredictable. 
Other methods depend on comparisons of the amino acid 
sequences of different but related proteins. However, this 
does not show which sites are important to stability, only 
which positions are different. 

There are two computer based methods for identifying 
the sites which affect the stability of a protein according 

to the invention. 

in the first method for identifying sites which affect 
the stability of protein, the first step comprises 
generating a probe-accessible surface by analyzing the 
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target protein coordinates with an uncharged probe molecule 
having a radius of about 0.9 to about 2.0 A. It is 
important that no water molecules be Included in the 
protein structure during this analysis. The second step of 
5 this method is the identification of the amino acids which 
form the boundaries of the internal cavities. These amino 
acids comprise a set of positions which, if mutated,, may 
increase the stability of the protein. An increase in 
stability can be achieved by amino acid substitutions which 

10 decrease the volume of the internal cavities. 

The molecular modeling program QUANTA (trademark of 
Polygen Corporation, 200 Fifth Ave., Waltham, MA 02254) was 
used to calculate probe-accessible surfaces as' well as 
perform the alignment of the three dimensional coordinates 

15 of the proteins. These functions can be carried out 
equally well by other molecular modeling programs which are 
also commercially available. The following is a list of 
commercially available programs which can also be used to 
calculate probe-accessible surfaces: Insight or Insightll 

20 (trademark of Biosym Technologies, Inc., 10065 Barnes 

Canyon Road - Suite A, San Diego, CA 92121), BIOGRAF 
(trademark of Biodesign, Inc., 199 S. Los Robles Ave., 
#270, Pasadena, CA 91101) or Sybyl (trademark of Tripos 
Associates, 1699 S. Hanley Road, St. Louis, MO 63144) 

25 The probe-accessible surface referred to in step 1 of 

the first method can be generated in several ways 
(Richards, P.M. (1977) Annu. Rev. Biophys. Bioeng. 6:151- 
176): A spherical probe of radius R (0.9 to 2.0 A) is 
allowed to roll on the outside of a molecule while 

30 maintaining contact with the van der Waal surface. The 
surface defined by^the center of the probe is defined as 
the probe-accessible surface. Alternatively, a similar 
surface can be generated by increasing the van der Waal 
radii of all the atoms in a protein by the radius of the 

35 probe. Overlapping surfaces are eliminated and the 
remaining surface represents the probe-accessible surface. 
In the preferred embodiment, a three-dimensional box Of 



SUBSTITUTE SHEET 



WO 92/21760 PCT/US92/04306 

h 

26 

dimensions 50x50x50 A with a 1 A grid size in all threa 
dimensions (x, y, and z) is centered on the center of mass 
of the target protein coordinates. Most preferrably, the 
dimensions of the probe map are adjusted such that all of 
5 the protein atoms fall within the probe map's bounds. The 
grid size of 1 A provides a sufficiently high resolution to 
clearly define the probe-accessible surface although 
another grid size could be used, ranging from 0.5 to 3.0 A. 
An uncharged probe molecule is positioned at each grid 
10 point and the energy of interaction between the probe and 
the target protein atoms is determined. The energy of 
nonbonded interaction (E^) contains only the van der Waal 
component such that 

XQOATXCm (1) 

15 where r is the nonbonded distance, e i3 is the dispersion 
well depth and cr 13 is the Lennard-Jones diameter. The 
result is a nap consisting of a box with energy values at 
each grid point. This map can be contoured at a particular 
energy value to generate surfaces which correspond to the 

20 solvent accessible surface and internal cavities (Good ford, 
P.J. (1985) J. Med. Chem. 28? 49-857). The value at which 
to contour the maps can var; depending on the particular 
radius used and the parameters used to define the probe 
molecule and the particular method used to generate the 

25 probe. The preferred embodiment is to used a probe radius 

of 0.9 A and contour the surface at 10 kcal/mol. 

The external surface of the probe-accessible surface 
is also known as the solvent-accessible surface. Probe- 
accessible surfaces inside of the solvent accessible 

30 surface are defined as internal cavities and represent 
cavities large enough to accomnodate a molecule with a 
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radius equal to the probe radius. The presence of such e 
cavity on the inside of a protein does not imply that the 
cavity will in fact be filled by one or more solvent 
molecules. 

5 The second step of the method for identifying sites 

which affect the stability of a protein is the 
identification of the amino acids which form the internal 
cavities. The internal cavities are defined by the amino 
acids which make up its boundaries. These amino acids 

10 comprise a set of positions which, if mutated, may increase 
the stability of the protein. 

in a second method for identifying sites which affect 
the stability of a protein , the first step comprises 
generating a probe-accessible surface by analyzing the 

15 target protein coordinates with an uncharged probe molecule 
having a radius of about 0.9 to about 2.0 A. It is 
important that no water molecules be included in the 
protein structure during this analysis. This step is the 
same as the first step of the method set forth above. 

20 The second step involves aligning the three 

dimensional structure of the target protein and a reference 
protein by moving the three dimensional coordinates of the 
reference protein into the coordinate frame of the target 
protein. The reference protein is usually chosen so that a 

25 high degree of similarity exists between it and the target 
protein so that packing differences between the target and 
reference protein which potentially affect the stability of 
the target protein can be identified. The reference protein 
can be any protein for which a three dimensional structure 

30 is available which is homologous to the target protein. 

Examples of such proetins include but are not limited to 
subtilisin Carlsberg, subtilisin BPN' , proteinase K, and 
Thermitase. When the target protein is BLAP, one 

preferred reference protein is Thermitase. Thermitase is an 

35 extra-cellular subtilisin-like serine protease isolated 
from Thermoactinomyces vulgaris (Frommel, C, et al. (1978) 
Acta Biol. Med. Ger. 37:1193-1204) . The protein amino acid 
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sequence of thermitase is 42% identical to BIAP. The high 
degree ot similarity between these two proteins provides an 
ideal system with which to examine packing differences that 
affect BIAP stability. In this second step the three 
5 dimensional structures of Thermitase and BIAP are aligned 
using the computer program QUANTA™. The three dimensional 
alignment is carried but by first aligning the primary 
sequences of the two proteins to determine which amino 
acids are equivalent. This is accomplished using FASTA 

10 (Myers, E.W. , and Miller, W. (1988) Comput. Applic. Biosci. 
4:11-17; Pearson, W.R., and Lipman, D.J. (1988) Proc. Natl. 
Acad. Sci. USA 85:2444-2448). Based on this alignment of 
the primary sequence, residues are matched for subsequent 
alignment of the three dimensional structures using MULTLSQ 

15 (Sutcliffe, H.J., et al. (1987) Protein Eng. 1:377-384; 

Kabsch, W. (1976) Acta Cryst. A32:922-923) . This program 
uses one structure as fixed coordinates (the target protein 
coordinates) and then rotates and translates a second 
structure (the reference protein coordinates) so as to give 

20 the smallest root mean squared (r.m.s.) deviation between 
the two sets of three dimensional coordinates. For 
example, the alignment of the BIAP and thermitase three 
dimensional coordinates results in an r.m.s. deviation 
between equivalent e-carbons of 0.8 A. This demonstrates 

25 that the amino acid sequences of BIAP and thermitase fold 
into three dimensional structures which are extremely 
similar. 

in the third step, the alignment of the three 
dimensional structures is used to identify sites which 

30 affect the stability of the target protein. This can be 
accomplished by a variety of methods. Using a computer 
program designed to display protein structures and surfaces 
such as QUANTA™, the structure of the reference protein can 
be displayed with the probe-accessible surface. The 

35 combined display of the reference protein and probe- 
accessible surface can then be visually examined to 
determine which amino acids in the reference protein fall 



SUBSTITUTE SHEET 



WO 92/21760 



PCT/US92/04306 



29 

outside of the solvent-accessible surface or inside 
internal cavities. An alternative method which can be used 
comprises coloring the atoms of the reference protein by 
determining whether amino acids in the reference protein 
5 fall outside of the solvent-accessible surface or inside 
internal cavities. The probe -accessible surface map (probe 
map) was used to color the atoms in the transformed 
subtilisin BPH' structure. In order to color each atom, an 
energy value needs to be interpolated from the probe map at 

10 each atomic coordinate. 

The probe map consists of three dimensional grid with 
an energy value (E) at each grid point. In the preferred 
embodiment, the probe map is a 50x50x50 A box centered on 
the center of mass of the protein with a 1 A grid unit in 

15 all three dimensions <x, y, and z) . In its optimal 
conception, the size of the probe map is adjusted such that 
all of the protein atoms fall within the probe map's 
bounds. The energy value at each protein atom position was 
approximated by interpolating from the energy values from 

20 the surrounded eight grid points in the probe map. Given 
the energy value at each point from the probe map, the grid 
spacing, and the atomic coordinate, it is a simple matter 
for any one skilled in the art to interpolate an energy 
value at each atomic coordinate. 

25 In one such method, an energy . value of zero is 

assigned arbitrarily if an atom falls outside the bounds of 
the map. Prom a given atomic coordinate (x,y,z), the eight 
closest grid points from the probe map which surround 
(x,y,z) are identified such that (x 1 <x<x 2 ), 

30 (y x < y < y 2 )# »« d < z i < z K 2 2)- The eight grid points are 

• then A (x 2 , y lf z x ) , B (x lf y 1# z 2 ) , C (x lt y 2 , z 2 ), 

/ D (x 1# y 2 , z x ) , E (x 2 , y 1( Zj) , P (x 2 , y lf z 2 ) , 

G <x 2 , y 2 , z 2 ), and H (x 2 , y 2 , Zj). The energy value (E) at 
a given grid point such as (x 1# y x , z a ) is then E{x,,y 1 ,r 1 ) or 

35 eguivalently E A . The energy at a specific atomic coordinate 
E, . can be interpolated from the probe map given the 
eight nearest surrounding grid points (A through H, as 
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described above) and the value at each grid point (B A 
through £«) . The equation vhich was used for calculating 
the energy at specific ato»ic coordinates, E (Jt , y , g) , is shown 
in Equation (2) . The energy value at each coordinate can 
5 then be stored and used to display the molecule. 

(2) 



10 



20 



where 



E 0 



and where 

The protein atoms were colored on the basis of this 
interpolated energy value. The protein was displayed using 
QUANTA™ and atoms with interpolated energies below 10 
Jccal/mol were colored as red. Atoms with interpolated 
energies above 10 kcal/mol were colored green. Visual 
inspection allowed identification of side chains which 
penetrated the solvent accessible surface or penetrated 

internal cavities. 

There are also two computer based methods for 
15 increasing the stability of a protein. The first method 
comprises the steps of: (1) generating a probe-accessible 
surface of said target protein by probing the coordinates 
of said protein with an uncharged probe molecule having a 
radius of about 0.9 to about 2.0 A, wherein said probe- 
accessible surface has an external surface the interior of 
which contains one or more probe-accessible internal 
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cavities; (2) identifying the amino acids which sake up the 
boundaries of the internal cavities, wherein said aaino 
acids comprise a set of sites which when nutated increase 
the stability of the protein; (3) identifying an aaino acid 
5 mutation which would decrease the volume of said internal 
cavities; (4) determining if said amino acid in said target 
protein can be changed without creating unacceptable steric 
interactions; (5) replacing the amino acid in said target 
protein by site-directed mutagenesis of the gene which 

10 expresses said target protein. 

The first two steps of the above first method for 
improving the stability of a protein are the same as those 
disclosed above for the first computer based method for 
identifying the sites which affect the stability of a 

15 protein. 

In step (3) an amino acid identified in step (2) is 
examined with the goal of identifying a mutation which 
would decrease the volume of said internal cavity. The 
size, shape and position of said internal cavity often 

20 defines and limits what mutations are acceptable and 
allowable given the distinct shape and size of each 
individual amino acid side chain. However, as a particular 
site in the protein has been identified for mutation, 
appropriate mutations can be also be determined by applying 

25 any of the various heuristics which define generally 
acceptable mutations (Matthews, B.W. , et al. (1987) Proc. 
Natl. Acad. Sci. 84:6663-6667; Henendez-Arias, L., and 
Argos, P. (1990) J. Hoi. Biol. 206:397-406; Sandberg, W.S., 
and Terwilliger, T.C. (1991) Trends Biotechnol. 9:59-63; 

30 Bordo, D. , and Argos, P. (1991) J. Mol. Biol. 217:721-729). 

in step (4) a determination is then made if the amino 
acid identified for change in the target protein can be 
mutated or changed without creating a conformation of the 
target protein having unacceptable steric interactions. The 

35 separation distance between two atoms considered 
unacceptably short is some percentage of the sum of the van 
der Waal radii of the two atoms in question. Values of 90- 
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95% of the sum of the van der Waal radii are common though 
others could be used. Common atoms between the original 
and replacement amino acid side chain are located and fixed 
in the same position. The new amino acid is rotated to 
5 find the position with the least number of close contacts 
or unacceptable steric interactions (distances shorter than 
physically reasonable) . The separation distance at which 
two atoms are considered unreasonably short is some 
percentage of the sum of the van der Waal radii of the two 

10 atoms in question. Values of 90-95% of the sum of the van 
der Waal radii are common though others could be used. If 
all conformations of the new amino acid have close 
contacts, the amino acid substitution is rejected. A 
conformation with no close contacts which can be matched to 

15 a preferred amino acid conformation as defined by Ponder, 
J.W., and Richards, P.M. (1987) J. Mol. Biol. 193:775-791, 
is most highly desirable. In step (6) the amino acid 
identified for change to the corresponding amino acid in 
the same position in the reference protein is changed by 

20 site-directed mutagenesis of the gene which expresses the 
target protein by the methods disclosed above. 

The second method comprises the steps of: 
(1) generating a probe-accessible surface of said target 
protein by probing the three dimensional coordinates of 

25 said protein with an uncharged probe molecule having a 
radius of about 0.9 to about 2.0 A, wherein said probe- 
accessible surface has an external surface the interior of 
which contains one or more probe -accessible internal 
cavities; (2) aligning said three dimensional coordinates 

30 of said target protein and a reference protein by moving 
the three dimensional coordinates of said reference protein 
into the coordinate frame of said target protein; 
(3) identifying an amino acid in said reference protein 
whose side chain lies outside said solvent-accessible 

35 surface of said protein or inside said internal cavities of 
said target protein; (4) identifying the amino acid in said 
target protein which occupies the equivalent position as 
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said amino acid in said reference protein; (5) determining 
if said anino acid in said target protein can be changed 
without creating unacceptable steric effects; (6) replacing 
the amino acid in said target protein with the 
5 corresponding amino acid in the equivalent position in said 
reference protein by site-directed mutagenesis of the gene 
which expresses said target protein. 

The first three steps of this method are the same as 
steps (1), (2), and (3) of the second method for the second 

10 computer based method for identifying the sites which 
affect the stability of a protein. 

In step (4) the amino acid in the target protein which 
occupies the equivalent position as the amino acid in the 
reference protein is identified. Equivalency is determined 

15 from the primary sequence alignment and three dimensional 
structure alignment described above. Given two protein 
structures, a target and a reference structure, which have 
been aligned, equivalent amino acids are defined as pairs 
of amino acids, one from the target and one from the 

20 reference protein, which may differ in identity but occupy 

close to the same position in the secondary and tertiary 
structure of the two proteins. 

The following examples are meant to illustrate but not 
to limit the invention. 
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Example 1 

identification of Sites in BLIP for MtigtBtiii 

The structure of BLAP was obtained by X-ray 
crystallography and solved to 1.4 A. The atonic coordinates 
5 are shown in Figure 1. Water molecules were removed from 
the structure and the protein coordinates were used to 
generate a probe-accessible surface using a computer 
program QUANTA™ (version 3.0) . This program can be used to 
calculate a probe-interaction map. The coordinates of BLAP 

10 were read into the computer and the following parameters 
were set in order to perform the probe interaction grid 
calculation. A Van der Waal calculation was requested with 
a "proton" probe (radius of 0.9 A) with a charge of 0.0. 
The box dimensions were set to 50 A with a grid sire of 

15 1 A centered on the a-carbon of residue 219. The maximum 
energy was set to 500 and the minimum to -100. This means 
that energy values which exceed 500 will be set to 500. An 
energy value will exceed 500 when the probe is very close 
to an atom in the protein. The calculations were performed 

20 on a Silicon Graphics Inc. (2105 Landings Drive/ Suite 
2105, Mountain view, CA 94043) 4D/220 Powerlris™ 
workstation. QUANTA™ was used to visualize the probe- 
accessible surface. The map was contoured at 50 kcal/mol 
but this value depends on the particular constants in use 

25 and the method used to generate the probe accessible 
surface. The map was displayed simultaneously with the 
structure of BLAP and amino acid side chains which defined 
the boundaries of the internal cavities were identified 
visually. 

one such amino acid was threonine-69 . This side chain 
is completely buried with only 2% of its surface being 
solvent accessible. The hydroxyl group of the side chain 
defined part of the border of two internal cavities. These 
particular cavities are occupied by water molecules 278 on 
one side, and 280 on the other. Mutating this amino acid 
to valine represents a conservative change which increases 
the hydrophobicity of the side chain while having little 



30 



35 
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effect on size and shape. Using computer modeling, it was 
determined that mutating threonine-69 to valine would not 
create any close contacts with other protein atoms or 
significantly perturb the structure if the valine occupies 
5 the same position as the hydroxyl of threonine-69 in the 
wild type protein. An oligonucleotide was synthesized 
which carried a mutation of the codon for threonine-69 to 
valine (T69V) • This oligonucleotide was used to create a 
site directed mutation in the BLAP gene which was subcloned 

10 into a Bacillus vector and expressed in B. subtilis DB104 
(See Examples 4 and 5) . Strains were identified which were 
expressing the mutant protease and several shake flasks 
were prepared to produce the mutant protein (See Example 
5), The mutant protease was purified from the shake flask 

15 media and characterized for surfactant and temperature 
stability (See Examples 7, 10, and 11). 

The mutation T69V resulted in a 340% increase in the 
half-life of the protease at 50°C, from 12 minutes to 41 
minutes (See Table 3). 

20 Example 2 

Identification of Sites in BIAP for mutagenesis 
based on other proteases . 
(A) Comparison to subtilisin Carlsberg. 

The three dimensional coordinates of subtilisin 

25 Carlsberg (1CSE) were obtained from the Brookhaven Protein 
Database (Bernstein, F.C., et al. (1977) J. Mol. Biol. 
112:535-542). The protease structures were aligned using 
the molecular modeling program QUANTA™. The BIAP 
coordinates were held fixed. The o-carbons of residues 1 

30 to 32 of BLAP were matched to residues 1 to 32 of 1CSE, 
respectively; residues 40 to 60 of BLAP to residues 41 to 
61 of 1CSE; residues 80 to 155 of BLAP to residues 82 to 
157 of 1CSE; residues 170 to 269 of BLAP to residues 176 to 
275 of 1CSE. The BLAP structure was held fixed, and the 

35 1CSE structure was rotated and translated such that the 
r.m.s. deviation between the o-carbons of matched residues 
was minimized. The translation vector 
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(-10.68738, 31.28904, -5.32134) and the rotation matrix 
( 0.17406 -0.65535 0.73500 
-0.42119 -0.72422 -0.54599 
0.89011 -0.21454 -0.40209 ) 
5 were applied to the coordinates of ICSE and the transformed 
coordinates were saved (henceforth, the transformed ICSE 
structure). The final r.m.s. deviation between the matched 
229 e-carbon pairs was 0.872 k. 

The probe-accessible surface nap calculated in 

10 Example 1 was used to color the atoms in the transformed 
ICSE structure. The entire sap, which consists of three 
dimensional grid of (x, y, z) coordinates in space and an 
energy value at each position, was read into .computer 
memory along with the protein coordinates (the transformed 

15 ICSE structure) . The energy value at each atom position 
was approximated by interpolating from the energy values of 
the surrounding eight nearest grid points in the probe map. 
The protein atoms were colored on the basis of this 
interpolated energy value. The protein was displayed using 

20 QUANTA™ and atoms were displayed in different colors 
depending on their interpolated energy value. For example, 
if the energy were greater than 400 the atoms were dark 
blue; between 300 and 400, light blue; 200 and 300, green; 
200 to 100 yellow; and between -100 and 100, red. Visual 

25 inspection of such a display allowed identif ication of side 
chains which penetrated the solvent accessible surface or 

internal cavities. 

One such amino acid was methionine-199 (ICSE 
numbering) in subtilisin Carlsberg. The amino acid was 
30 identified by visual inspection of the transformed ICSE 

structure (as described above) . Below, the coordinates of 
residue 199 from the transformed ICSE structure are shown 
in the Brookhaven Protein Data Bank file format along with 
the interpolated energy values. 
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coordinates of Mathionine-199 
from the 1.2 A atruoture of subtiliain Carlsberg. 



ATOM 



_ wri !364 N MET 199 22.392 40.705 32.311 1.0 500.00 

ATOM 1365 CA MET 199 21.675 40.581 '31.054 1.0 500.00 

ATOM " 
ATOM 



1366 C MET 199 22.438 39.677 30.103 1.0 500.00 

BW 1367 O MBT 199 23.689 39.601 30.254 1.0 500.00 

ATOM 1368 CB MBT 199 21.621 41.991 30.511 1.0 500.00 

ATOM 1369 CO MBT 199 20.868 42.994 31.426 1.0 500.00 

10 ATOM 1370 SD MBT 199 19.150 42.631 31.891 1.0 211.58 

ATOM 1371 CE MBT 199 18.273 43.395 30.493 1.0 41.68 

Column 1 Is the record type; column 2 is the atom number; 
column 3 is the atom name; column 4 is the residue name; 

15 column 5 is the residue number; columns 6, 7 4 8 are the 
x, y, z coordinates of the atom, respectively; column 9 is 
the occupancy; column 10 is normally the temperature factor 
but this has been replaced with the interpolated energy 
value. Note that a value of 500 in this column means that 

20 the atom in nearly completely within the van der Waal 
surface of the B1AP molecule. When the probe map was 
calculated (see Example 1) , energy values greater than 500 
were set to 500. As can be seen, atoms 1370 and 1371 have 
significantly lower energy values (column 10). The end of 

25 this methionine residue extends into an internal cavity in 
the BLAP molecule. 

This residue is equivalent in secondary and tertiary 
structure to valine-193 in BLAP. Using computer modeling, 
valine-193 in BLAP was changed to methionine. The x values 

30 for the new methionine side chain in BLAP were taken from 
the subtilisin BPN' structure. In this conformation, the 
new side chain had no close contacts except for the e- 
carbon of the methionine which contacted a crystal lographic 
water in the BLAP structure. 

35 An oligonucleotide was synthesized which mutated the 

codon for valine-193 to methionine (V193M) in the BLAP 
gene. This oligonucleotide was used to create a site 
directed mutation in the BLAP gene which was subcloned into 
a Bacillus vector and expressed in B. subtills DB104 (See 

40 Examples 3, 4, and 5). Strains were identified which were 
expressing the mutant protease and several shake flasks 
were prepared to produce the mutant protein (See Example 
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5, . The mutant protease was purified from the shake flask 
media and characterized for temperature and surfactant 
stability (See Examples 6, 7, 10, and 11). 

The mutation V193M resulted in a 350% increase in the 
5 half-life of the protease at 50-C, from 12 minutes to 42 
minutes (See Table 3) . 
(B) Comparison to Thermitase. 

The three-dimensional coordinates of thermitase (1TEC) 
were obtained from the Brookhaven Protein Database 
10 (Bernstein, F.C., et al. (1977, J. Mol. Biol. 112:535-542). 
The structures of BLAP and 1TEC were aligned using the 
molecular modeling program QUANTA™ by matching eguivalent 
a-carbons as listed below. 

Matched e-carbone between 
BIAP and Thermitase (1TBC, 
Bi&E i3J2£ 

5-20 12-27 

29-41 

43-72 52-81 
7R-227 85-237 
20 ; 3 y 2 2 5 7 6 240-264 

The BLAP structure was held fixed and the 1TEC structure 
was rotated and translated such that the r.m.s. deviation 
between the a-carbons of matched residues was minimized. 
The translation vector (14.92521, 33.43270, 40.92134) and 

the rotation matrix 

( 0.79048 -0.20395 -0.57753 
-0.01688 0.93532 -0.35340 
0.61225 0.28911 0.73591 ) 
were applied to the coordinates of 1TEC and the transformed 
coordinates were saved (henceforth, the transformed 1TEC 
structure) . The final r.m.s. deviation between the matched 
236 a-carbon pairs was 1.384 A. 
35 The probe-accessible surface map was used to color the 

atoms in the transformed 1TEC structure. The entire probe 
map was read into computer memory along with the 
coordinates of the transformed 1TEC structure. The energy 
value at each atomic position was interpolated from the 
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energy values of the eight surrounding grid points in the 
probe nap. The protein was displayed using QUANTA™ and 
atoms were displayed in different colors as a function of 
their interpolated energy value. For example, if the 
5 energy were greater than 400 the atoms were dark blue; 
between 300 and 400, light blue; 200 and 300, green; 200 to 
100 yellow; and between -100 and 100, red. Visual 
inspection of such a display allowed identification of side 
chains which penetrated the solvent accessible surface or 
10 internal cavities. 

One such amino acid was tyrosine-149 (1TEC numbering) 
in thermitase. The amino acid was identified by visual 
inspection of the transformed 1TEC structure. Below, the 
coordinates of residue 149 from the transformed 1TEC 
15 structure are shown in the Brookhaven Protein Data Bank 
file format along with the interpolated energy values. 
Coordinates of Tyrosine-149 
from the 2.0 A structure of Thermitase. 



20 



23.026 47.326 1.0 500.00 

21.668 47.275 1.0 500.00 

21.557 46.165 1.0 500.00 

21.330 46.466 1.0 500.00 

»™ — 20.595 47.169 1.0 500.00 

ATOM 1057 CG ITR 149 19.859 19.183 46.935 1.0 227.30 

25 ATOM 1058 CD1 TYR 149 20. 5 « 18 . 42 7 48.038 1.0 19.11 

18.722 45.608 1.0 275.01 

17.146 47.607 1.0 10.99 

17.425 45.371 1.0 500.00 

16.649 46.488 1.0 131.28 

15.337 46.282 1.0 147.29 



30 



ATOM 


1052 


N 


TYR 


149 


19.783 


ATOM 


1053 


CA 


TYR 


149 


20.372 


ATOM 


1054 


C 


TYR 


149 


21.456 


ATOM 


1055 


0 


TYR 


149 


22.619 


ATOM 


1056 


CB 


TYR 


149 


19.262 


ATOM 


1057 


CG 


TYR 


149 


19.859 


ATOM 


1056 


CD1 


TYR 


149 


20.262 


ATOM 


1059 


C02 


TYR 


149 


20.014 


ATOM 


1060 


CE1 


TYR 


149 


20.762 


ATOM 


1061 


CE2 


TYR 


149 


20.531 


ATOM 


1062 


CZ 


TYR 


149 


20.860 


ATOM 


1063 


OH 


TYR 


149 


21.165 



column 10 is normally the temperature factor but this has 
been replaced with the interpolated energy value. As can 
be seen, the phenyl ring of the tyrosine side chain has 

35 significantly lower energy values (column 10 of atoms CG, 
CD1, C02, CE1, CE2 and CZ) . 

This residue is equivalent in secondary and tertiary 
structure to serine-139 in BIAP. Using computer modeling, 
serine-139 in BIAP was changed to tyrosine. The % values 

40 for the new tyrosine side chain in BIAP were taken from the 
thermitase structure. In this conformation, the new side 
chain had no close contacts that could not be alleviated by 
small changes (less than 5°) of the % values. The modeled 
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tyrosine side chain in BIAP fit. neatly into a crevice on 
tne eurface of the BIAP protein between two surface 

hCUC In' oligonucleotide was synthesized which mutated the 
5 codon for serine-139 to tyrosine (S139Y) in the BIAP gen- 
This oligonucleotide was used to create a site directed 
nutation , in the BLAP gene which was subcloned into a 
Bacillus vector and expressed in B. ^"£J£ 
Examples 3, 4, and 5). Strains were identified 1 which 

10 expressed the mutant protease and several shaJce f lasKs were 
prepared to produce the mutant protein (See Example , 3). 
L mutant protease was purified from the shake flask 
culture and characterized for temperature and surfactant 
stability (See Examples 6, 7, 10, and 11). 

, 5 The mutation S139Y resulted in a 216% increase in the 

half-life of the protease at SO'C, from 12 minutes to 26 
minutes (See Table 3). 

Example 3 

Site Directed Mutagenesis of the BIAP gene 



20 This mutagenesis procedure was first described by 

Stanssens, P., et al. (1989) Nucleic Acids Res. 
17-4441-4454. While this is the preferred method, many 
other methods could be used to introduce oligonucleotide 
site-directed mutations, particularly those which use 
25 single stranded DNA. For example, the method of Kunkel 
(KunRel, T.A. (1985, Proc. Natl. Acad. Sci. USA 82:488-492) 

has also been used. 

A synthetic oligonucleotide was synthesized which 
mutates the codon of threonine-69 to the codon for valine. 

30 The mutagenic oligonucleotide was annealed to a gapped 
duplex DNA which carries the BIAP gene on a segment of 
single stranded (ss) DNA. The gapped duplex (gd) was 
formed by denaturing linear DNA 1 s from P Mcl3C and pMa5-8 
followed by re-annealing. The mutagenic oligonucleotide 

35 annealed to homologous ss BIAP DNA within the gap and the 
remaining gap was filled in by a DNA polymerase and ligated 
using T4 DNA ligase. Subsequent treatment of the product 
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of the fill-in reaction with £xoIII removed any species 
with only partially filled gaps. 

The product of the fill-in reaction was then 
transformed into a repair deficient E. coli strain such as 
5 WK6mutS. Plasmid DNA from the recombinant E. coli WK6nutS 
was prepared and transformed in a low plasmid/recipient 
ratio into a suppressor deficient strain of E. coll such as 
WK6. Ampicillin resistant transformants were selected and 
plasmid DNA of several candidates was purified and checked 

10 for the presence of the mutation. 

The mutant BLAP protease was expressed by transferring 
the mutant BLAP genes from their particular E. coli pMal3C 
derivative vector into a plasmid vector which can replicate 
in Bacillus such as pH70 or pC51. In the following 

15 example, the plasmids pC5l and pH70 can be used 
interchangeably with the exception that plasmid pH70 
encodes resistance to kanamycin while plasmid pC51 encodes 
resistance to tetracycline. The mutant BLAP gene was 
separated from the pMal3C plasmids by digestion with the 

20 restriction endonucleases Aval and SstI and then ligated 
with an Aval-SstI cut fragment of plasmid pH70 that 
includes the regions necessary for kanamycin resistance and 
for replication in Bacillus. The pH70 Aval-SstI fragment 
was purified by high pressure liquid chromatography (HPLC) . 

25 After ligation the mutant BLAP plasmids were transformed 
into B. subtilis DB104, a strain that has been engineered 
to inactivate its own genes encoding the major alkaline and 
neutral proteases. B. subtilis DB104 transformed by these 
plasmids were grown on a nutrient-skim milk agar in the 

30 presence of the antibiotic kanamycin. Clones that 
manufactured mutant protease were identified by the 
formation of clear zones of hydrolysis in the skim milk. 
Plasmid DNA was purified from these clones to verify that 
the protease-producing clones carried the a plasmid-borne 

35 BLAP gene with the desired mutation. The plasmid DNA was 
purified away from cell protein and chromosomal DNA by SDS- 
salt precipitation followed by chromatography over a Qiagen 
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ion-exchange column (Qiagen Corporation). Aval-SstI 
digested plasmid DNAs fro* different clones were compared 
with Aval/Sstl-digested derivatives of plasmid pH70 known 
to carry an intact BLAP gene. Plasmid digests were compared 
by agarose gel electrophoresis to identify plasmids that 
have the proper-sized Aval/SstI DMA fragments. Selected 
plasmid DNAs were then sequenced across the region of the 
particular BLAP mutation to confirm that the mutation was, 
present. One or more clones of each BLAP mutation were 
stored frozen in 15% glycerol at -70»C and also cultivated 
in shake flasks (Examples 4 and 5) to manufacture mutant 
protease for characterization. 

Example 4 
production of Proteases 
Each strain of B. subtilis DB104 that carried a 
plasmid with one of the mutant BLAP genes was cultivated in 
shake flasks to make the mutant protease. Strains were 
grown in 50 ml precultures of (Difco) Luria Broth (LB) with 
the antibiotic kanamycin for pH70 derived clones or 
tetracycline for P C51 derived clones at 37»c and 280 rpm in 
a New Brunswick Series 25 Incubator Shaker. After 7 to 8 
hours of incubation 2.5 or 5.0 ml of the preculture was 
transferred to 50 or 100 ml of MLBSP medium (Table 5), 
respectively, with either 20 nq/l of kanamycin/ or 
25 15 Mg/"»1 of tetracycline in 500 ml (Bellco) baffled shake 

flasks for growth and eventual production of the protease. 
These main shake flask cultures were incubated at 240 rpm 
and 37 »C for 64 hours before the culture broths were 
treated to remove intact cells and cellular debris, and to 
30 reduce the pH to 5.8 before they were concentrated. The 
protease production of each culture was monitored by 
electrophoresis of culture supernatants with reverse 
polarity on 12.5% homogenous polyacrylamide gels with the 
Pharmacia PhastSystem. 

Example 5 

Production of Mutant proteases in Shake F leaks 



35 
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A hot loop was used to streak each mutant strain from 
a frozen cryovial culture onto an LB-skim milk agar 
containing either 20 nq/ml of kanamycin or 15 /*g/ml of 
tetracycline. The plates were incubated at 37 »C for 20 to 
5 24 hours. A single, isolated colony producing a good lone 
of hydrolysis of the skin milk was picked into a 250 ml 
Erlenmeyer flask containing about 50 ml Luria Broth (LB) 
which contained either 20 /*g/ml kanamycin or 15 jig/ml of 
tetracycline. The broth was incubated in a New Brunswick 

10 Series 25 Incubator Shaker at 37«C with shaking at 280 rpm 
for 7 to 8 hours. Either 2.5 ml of the turbid preculture 
was transferred into 50 ml of MLBSP containing either 
20 fig/ml kanamycin or 15 Mg/»1 of tetracycline in each of 
four baffled 500 ml flasks, or 5 ml of preculture was used 

15 as an inoculum for 100 ml of MLBSP broth with antibiotic 
contained in each of two 500 ml baffled flasks (a 5% v/v 
transfer). All flasks were incubated at 240 rpm and 37°C 
for 64 hours. After 64 hours of incubation the set of 
flasks for each culture was consolidated, transferred to 50 

20 ml centrifuge tubes, and centrifuged at 20,000 g aT for 15 
minutes at 4°C. The broth was filtered through Miracloth 
(Calbiochem Corp. #475855) into 400 ml beakers chilled on 
ice. The broth was slowly stirred on ice for 30 minutes 
before the broth pH was reduced to 5.8 by the slow addition 

25 of glacial acetic acid. More fine debris were removed by 
centrif ligation again at 20,000 g av and the broth was 
filtered through Miracloth into graduated cylinders to 
measure the volume. Two sets of 1 ml samples were made for 
PhastSystem gels and activity assays. The broth was stored 

30 on ice until the protease could be purified. The MLBSP 
media used for the production of BLAP in shake flask 
cultures Is described in Table 5. 

35 
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TABLE 5 

COMPOSITION OP KLBSP MEDIUM 



Component 



Quantity (for 1 liter of nedia) 



10 


deionized water 


750 ml 




Difco Casitone 


10 gm 




Oifco Tryptone 


20 gm 


15 


Difco Yeast Extract 


10 gm 




MaCl 


5 gm 

* 


20 


Sodium Succinate 


27 gm 



The *edi. v.. adjusted to pH of 7 oy m» ^——3,5 
^ e ? IS lbs/in*. «T. «" » we. cooled before 

eauSl tne .Uril. stock soXutions described in Appends 1, 
while stirring. 

APPENDIX 1 (additions to MLBSP broth) 

Component 



Quantity 
(for 1 L of media) 



MgS0 4 .7H 2 o 
CaCl 2 .2H 2 0 
FeS0 4 .7H 2 0 
MnCl 2 .4H 2 0 
Glucose 
PIPES Buffer 1 
KPO A Buffer 2 



(100 mg/ml stock, autoclaved) 
(30 mg/ml stock, autoclaved) 
(lmM stock, filter sterilized) 
(lmM stock, autoclaved) 
(25% (w/v) stock, autoclaved) 
(pH 7.2, 1 K stock, autoclaved) 50.0 ml 
(1.5 M stock, autoclaved) 50.0 ml 



1.0 ml 
2.5 ml 
0 .5 ml 
0.5 ml 
80.0 ml 



1 Pioera*ine-N.N*-bi«(2-ethan» sulfonic acid). 

2 A Hmeient Lount of 1.5 K diba.ic phoaphaf (Jf»*) 

200 X S Tf 1.5 K n»noba.ic pho.ph at » iW^/ffiSL? '^ItUl 

Either kanamycin or tetracycline antibiotic stock solutions 
were added to the media just before use to a final 
concentration of 20 n/nl and 15 /.g/ml respectively. 
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Bxaaple C . 

purification of BLXP 
Fermentation broth of transformed B. subtllls DB104, 
while still in the fermenter, was adjusted to pH 5.8 with 
5 4 N H 2 S0 4 . The broth was collected and cooled to 4 # C. If 
not mentioned otherwise, all subsequent steps were 
performed on ice or at 4*C. An aliquot of the broth 
material was clarified by centrifugation at 15,000 x g a ^, 
for 60 min. Floating lipid material was removed by 

10 aspiration, and the supernatant filtered through Miracloth. 
The dark brown solution was placed in dialysis tubing 
(Spectrapor; #1, 6 to 8 kilodalton (kDa) molecular-weight- 
cut-off , 1.7 ml/cm) and dialyzed for 16 hours in 20 mM 2- 
(N-morpholino)ethanesulfonic acid (MES) containing 1 mM 

15 CaCl 2 , adjusted with NaOH to pH 5.8 ( 'MES buffer ■) . The 
dialysate was clarified by centrifugation (20,000 x g av# for 
10 min) and the pH of the solution was adjusted to 7.8 with 
2 N NaOH. The enzyme solution containing approximately 
0.9 g of protein in 1.2 liter was loaded at a flow rate of 

20 150 ml/hour onto a column of S-Sepharose Fast Flow (SSFF, 

Pharmacia; 25 mm diameter, 260 mm long) previously 
equilibrated with 20 mM N-(2-hydroxyethyl)piperazine-N , -(2- 
ethanesulfonic acid) [HEPES], containing 1 mM CaCl 2 , 
adjusted with NaOH to pH 7.8 ( ■HEPES buffer 1 ). After the 

25 application of the enzyme solution the column was washed 

with 2 column volumes (250 ml) of HEPES buffer and then 
developed at a flow rate of 140 ml/hour with a gradient of 
0 to 0.25 M NaCl in 600 ml of HEPES buffer. The gradient 
eluate was fractionated into 5.2-ml aliquots which were 

30 collected into tubes containing 2 ml of 100 mM MES/Na*, 
pH 5.8. The enzyme eluted between 0.12 and 0.15. M- NaCl . 
Fractions containing the enzyme were pooled and protein was 
precipitated with ammonium sulfate at 52% of saturation. 
Solid salt (0.33 g per ml of solution) was added slowly 

35 with stirring over a period of 15 min, and stirring was 
continued for another 15 min. The precipitate was 
collected by centrifugation, the pellet was dissolved in 



SUBSTITUTE SHEET 



WO 92/21760 



PCT/US92/04306 



46 

MES buffer and the protein concentration in the solution 
was adjusted to 5 to 7 mg/ml. Following dialysis for 
16 hours in MES buffer the solution was clarified by 
centrifugation and the pH of the supernatant was adjusted 
5 to 7.2. The protease was purified further by a second 
cation exchange separation on SSFF. All steps of this 
procedure were the same as above except that the pH of the 
HEPES buffer was 7.2 and that the NaCl gradient was from 0 
to 0.25 M in 600 ml of HEPES buffer. Protein in pooled 

10 fractions was precipitated as above with ammonium sulfate 
and the enzyme was stored as ammonium sulfate precipitate 
at -70°C. Prior to use the ammonium sulfate precipitate of 
the enzyme was dissolved in an appropriate ' buffer, 
typically MES buffer, at the desired protein concentration, 

15 and dialyzed overnight in the buffer of choice. 

Example 7 
Purification of BIAP Mutants 
Fermentation broth from shaXe flasks, on average 
180 ml, was collected and clarified by centrifugation at 

20 20,000 x g av . for 15 min. The supernatant was placed, with 
stirring, on ice and after 30 min the pH of the solution 
was adjusted to 5.8 with glacial acetic acid. If not 
mentioned otherwise, all subsequent steps were performed on 
ice or at 4°C. The solution was clarified again by 

25 centrifugation (20,000 x g 1T . for 15 min) and was 
concentrated approximately 4-fold by ultrafiltration 
(Amicon; YM30 membrane) . The dark brown solution was 
placed in dialysis tubing (Spectrapor; #1, 6 to 8 KDa 
molecular-veight-cut-off, 1.7 ml/cm) and dialyzed for 

30 16 hours in 20 mM HEPES /Na + , pH 7.8, containing 1 mM CaCl 2 
(•HEPES buffer'). The dialysate was clarified by 
centrifugation (20,000 x g av . for 10 min) and the pH of the 
solution, if necessary, was adjusted to 7.8 with 2 H NaOH. 
The enzyme solution was loaded at a flow rate of 60 ml/hour 

35 onto a column of SSFF (15 mm diameter, 75 mm long), 
previously equilibrated with HEPES buffer. When all 
colored by-products were eluted, the column was washed with 
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50 ml of HEPES buffer. Then, the enzyme was eluted with 
0.25 M NaCl in HEPES buffer. Fractions of 1.2 ml were 
collected into tubes containing 0.5 al of 100 mM MES/Na 4 , 
pH 5.8. Protein content in fractions was monitored either 
by a UV detector set at 280 nn or by protein assay as 
described below. Pooled fractiohs containing protease 
protein were placed on ice and protein was precipitated 
with a 5 to 8-fold volume excess of acetone at -20*C. The 
protein was allowed to precipitate for 6 min, the mixture 
was centrifuged for 4 min at 6,600 x g av> , the supernatant 
was discarded, the pellet was briefly exposed to vacuum 
(water aspirator) to remove most of the acetone, and the 
pellet was dissolved in 20 mN MES/Na*, pH 5.8, to give an 
approximate protein concentration of 30 »g/ml. Prior to 
any assays, the solution was centrifuged in an Eppendorf 
centrifuge for 3 min at full speed (13,000 x g max# ). 

Example 8 
Protein Determination 
Protein was determined by a modified biuret method 
(Gornall, A.G., et al. (1948) J. Biol. Chem. 177:751-766). 
The protein in a total volume of 500 nl was mixed with 
500 til of biuret reagent and incubated for 10 min at 50°C. 
The solution was briefly chilled and its absorbance was 
measured at 540 nm. Typically, a reagent blank and three 
different protein aliquots in duplicates were measured and 
the recorded optical densities analyzed by linear 
regression. Bovine serum albumin (BSA, crystalline; 
Calbiochem) was used as protein standard. With purified 
BIAP protein the usefulness of BSA as protein standard in 
the biuret assay was confirmed. A BLAP sample was 
exhaustively dialyzed in 1 mM sodium phosphate, pH 5.8, and 
subsequently lyophilized. A sample of the solid material 
was weighed, dissolved in 1 mM sodium phosphate, pH 5.8, 
and used to generate a standard curve for the biuret assay. 
From the actual difference in phosphate content (Black, 
M.J., and Jones, M.E. (1983) Anal. Biochem. 135:233-238) of 
the final protein solution and the nominally l mM sodium 
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phosphate solution used to dissolve the protein, the 
contribution of phosphate to the weight of solid BLAP was 
estimated and used to correct the standard curve. 

Example 9 

5 protease Assays 

Two different protease assays were used. With the HPE 
method protease activity was established at a single 
concentration of casein (prepared according to Hammarsten; 
Merck, #2242) as substrate. In the AAPF-pNA assay initial 
10 rates of succinyl-L-alanyl-L-alanyl-L-prolyl-L-phenylalanyl- 
p-nitroanilide (AAPF-pNA; Bachem) supported catalysis were 
used to determine the kinetic parameters k^, and 

A. HPE Method. 

IS culture supernatants or solutions of purified 

proteases were diluted with chilled buffer (10 mM MES/Na*, 
pH 5.8) to give three different solutions with a protein 
concentration ratio of 1:3:5. The substrate solution 
contained 9.6 mg/ml casein, 24 mM Tris, and 0.4% (w/v) 

20 sodium tripolyphosphate, dissolved in synthetic tap water 
(STW; 0.029% (w/v) CaCl 2 »2H 2 6, 0.014% (w/v) MgCl 2 *6H 2 0, and 
0.021% (w/v) NaHC0 3 in deionized water) adjusted to pH 8.5 
at 50 *C, prepared as follows. With stirring for 10 min, 
6 g of casein was dissolved in 350 ml of STW. To this, 

25 50 ml of 0.3 M Tris in STW was added and stirring was 
continued for another 10 min. This solution was heated to 
70* C, then allowed to cool slowly. At 50 # C, the pH was 
adjusted to 8.5 with 0.1 N NaOH. When the solution reached 
room temperature/ the volume was adjusted to 500 ml with 

30 STW, followed by the addition of 125 ml of 2% (w/v) 
pentasodium tripolyphosphate in STW, pH 8.5 (adjusted with 
3 N HC1) . The protease assay was started by adding 50 /xl of 
protease solution to 750 pi of substrate solution placed in 
a 2.2 ml Eppendorf container preincubated for 10 min at 

35 50 # C. After 15 min, the reaction was terminated by the 
addition of 600 nl of trichloroacetic reagent (0.44 M 
trichloroacetic acid, 0.22 M sodium acetate in 3% (v/v) 
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glacial acetic acid) . The mixture was placed on ice for 
15 min, the precipitated protein removed by centrifugation 
for 8 min (at 13,000 x g mM .) and a 900 /*1 aliquot of the 
supernatant was mixed with 600 pi of 2 N NaOH. The 
5 absorbance at 290 nm of this solution was recorded. Each 

dilution was assayed in duplicates and the data points for 
three different dilutions from one enzyme sample was 
analyzed by linear regression. A slope of 1 in this assay 
corresponds to 80 HPE units in the least diluted sample. 

10 In case of strongly colored culture supernatants with 
measurable quantities of UV absorbing material carried over 
by the diluted protease aliquot into the assay cuvette a 
control curve was constructed whose slope was subtracted 
from the slope of the protease assay before final HPE units 

15 were calculated. 

B. AAPF-pNA Assay 

Protease samples were diluted with 501 (v/v) 
1,2 -propanediol in 100 mM Tris, adjusted with 2 N HC1 to 
pH 8.6 at 25°C ( 'Tris-propanediol buf fer 1 ), in which they 

20 were stable for at least 6 h at room temperature. A stock 
solution of 160 mM AAPF-pNA was prepared in 
dimethylsulfoxide dried with a molecular sieve (Aldrich; 
4 A, 4-8 mesh) for at least 24 h prior to use. Fixed point 
assays were performed at 25°C with 1.6 mM AAPF-pNA in 

25 100 mM Tris/ adjusted with 2 N HC1 to pH 8.6 at 25 # C, in a 

total volume of 1.020 ml. The substrate was added to the 
assay buffer 1 min prior* to the assay initiation and the 
reaction was started by addition of enzyme at a final 
concentration of 20 ng to 1.3 jig of protein per ml (0.75 to 

30 48.5 nM enzyme) depending on specific activity. Release of 

p-nitroanilide was monitored at 410 nm, and a molar 
extinction coefficient of 8,480 M^cm" 1 was used to 
calculate amount and concentration of product formed 
(DelMar, E.G., et al. (1979) Anal. Biochem. 99:316-320). 

35 Kinetic parameters were calculated from a velocity vs. 

substrate concentration plot constructed from initial rates 
measured once each at 12 different AAPF-pNA concentrations 
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ranging from 0.16 to 3.2 «M. Data were fitted to a 
hyperbolic curve and proportionally weighted using the 
program EMZFITTER (Leatherbarrow, R.J . (1987) ENZFITTER, 
Biosoft, Cambridge, UK). A nominal molecular weight of 
5 26.8 kDa was used in all calculations that required the 
interconversibn of protein concentration and molarity of 
protease enzyme. 

Example 10 

Temperature stability of Purified Proteases 

10 stability of protease proteins was evaluated under two 

different conditions: (a) 100 mM glycine/Na\ pH 10 at 60 °C, 
and (b) 100 mM glycine/Na*, pH 11 at 50'C. At t = 0 min, 
the protein was diluted to approximately 0.25 mg/ml into 
incubation buffer maintained at the desired temperature. 

15 periodically, an aliquot was removed from this incubation 
mixture and diluted into Tris-propanediol buffer chilled on 
ice. Residual protease activity was determined by the 
AAPF-pNA assay at a fixed AAPF-pNA concentration (1.6 mM) . 
Stability is expressed as half -life (t 1/2 ) of activity 

20 determined from semi-logarithmic plots of residual activity 
as function of time. Each plot consisted of 6 data points 
with t 1/2 approximately in the center between experimental 
points. 

Example 11 

25 Resistance of Proteases to Sodium Dodeeylsulfate (80S) 

SDS was selected as representative of surfactants in 
general. Resistance of proteases to SDS was evaluated 
under two different conditions: (a) 100 mM Tris adjusted 
with 2 N HC1 to pH 8.6 at 50°C, containing 1% (w/v) SDS, 
30 and (b) 50 mM sodium carbonate, pH 10.5 at 50°C, containing 
1% (w/v) SDS. Protease proteins were incubated at a final 
protein concentration of 0.25 mg/ml. Data were collected 
and evaluated as described above under Example 10. 

Example 12 

35 polyaerylamide 6*1 Electrophoresis 

Purity of protease samples was evaluated on 20% non- 
denaturing PhastSystem gels (Pharmacia) run with reversed 
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polarity. The sane system was used to monitor the protease 
content of crude shake flask and fermentation broths. 
Buffer stripe were prepared as described in Application 
File No. 300 (Pharmacia). 
5 Molecular weight determinations were performed on 20% 

SDS PhastSystem gels/ using the following markers: bovine 
serum albumin, 66 kDa; egg albumin, 45 kDa; glyceraldehyde- 
phosphate dehydrogenase, 36 JcDa; carbonic anhydrase, 
29 kDa; trypsinogen, 24 kDa; trypsin inhibitor, 20.1 kDa; 

10 a-lactalbumin, 14.2 kDa (all from Sigma). Prior to 
SDS-PAGE, a protease sample was denatured with formic acid 
at a final concentration of 30 to 50% (v/v) . Upon dilution 
of formic acid to 15% (v/v) protein was precipitated with 
trichloroacetic acid at a final concentration of 10% (v/v) . 

15 The collected pellet was washed once with water, then 
dissolved in 2% (w/v) SDS and heated for 2 min in a boiling 
waterbath. Gels were stained with Coomassie Brilliant Blue 
R-250 (Kodak) . 

DEPOSIT OF MICROORGANISMS 

20 Living cultures of the following have been accepted 

for deposit under the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the 
purposes of patent procedure by the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville, MD 20852 on 

25 May 8, 1991 (the accession number preceeds each deposit 
description) : ATCC 68614 - Bacillus licheniformis ATCC 
53926 strain which contains a tetracycline-resistance 
plasmid originally derived from Bacillus plasmid pBC16 
which carries the ATCC 53926 alkaline protease-BLAP Clal 

30 fusion gene, whose structural gene has the mutations S3T, 
V4I, A188P, V193M, V199I; ATCC 68615 - E. coli WK6 Which 
carries phasmid pMcl3C, a chloramphenicol-resistant 
derivative of phasmid pMc5-8, that contains the ATCC 53926 
alkaline protease- BLAP Clal fusion gene and a 164 bp Kpnl 

35 fragment carrying the ATCC 53926 alkaline protease gene's 
transcriptional terminator. The genotype of strain WK6 
are Alas-jjcaAB, S2lE# strA , autS: eTnlO/F'ladg, ZAM15, 



SUBSTITUTE SHEET 



WO 92/21760 PCT/US92/04306 

li 

52 

p«*V (Zell, R» / and Fritz, H. -J. (1987, EMBO J. 6:1809- 
1815); ATCC 68616 coU GM33 which carries plasm id 

pCB13C, an ampicillin-resistant derivative of Pharmacia 
plasmid vector P TZ19R (Pharmacia) that contains the ATCC 
5 53926 alkaline protease-Clal fusion gene. The CM33 
strain's genotype is dam.3 <dam-m*thylase minus (Marinus, 
M.G. and Morris, N.R. (1974) J. Mol. Biol. 85:309-322)); 
ATCC 68617 - E. eoli WK6 which carries phasmid pKa5-8, an 
ampicillin-resistant mutagenesis vector described in 

10 Stanssens, P. t al. (1989, Nucleic Acids Research 
17:4441-4454. The genotype of strain WK6 mutations are 
Alas-EroAB, galE, strA, jnutS: :TnlO/F'lacl9, ZAM15, proA B 
(Zell, R. , and Fritz, H. -J. (1987) EMBO J. 6:1809-1815); 
ATCC 68618 - an E. coli WK6 which carries phasmid pMc5-8, 

15 a chloramphenicol-resistant mutagenesis vector described in 

Stanssens, P., et al. (1989, Nucleic Acids Res. 17:4441- 
4454. The genotype of strain WK6 are Alac-proAB, galE, 
StEA, ffiU±S::TnlO/F'lacI<, ZAM15 , ET^B* (Zell, R. , and 
Fritz, H. -J. (1987, EMBO J. 6:1809-1815,. 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-832-2215 

(B) TELEFAX: 215-941-6067 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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54 

i 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: ^ '■ 

(B) CLONE:, S3T, V4I, A188P, V193M, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ar 
10 



Ala Gin Thr He Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 

Gly val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 ■ 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 1 - 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 I 55 160 
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Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 l* 70 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 

Met Ala Pro Gly Val Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 * 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, A188P, V193M, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ala Gin Thr Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
x 5 10 15 
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His Asn Ar, Gly Leu Thr Sly Ser My val Lys v.l Ala v.l Uu Asp 

20 25 
Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 

His val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 



70 



65 

Gly val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

8 5 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
J 115 120 1" 

Pro ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 I 35 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
14 5 150 " 3 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 
180 185 



Met Ala Pro Gly Val Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 



195 



200 



Ala Ser 
210 



Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 



225 



230 



235 
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i 1, 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPEr amino acid 

(D) TOPOLOGY: unknown , 
(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: V4I, A188P, V193M, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Gin Ser lie Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 
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Asp Gly at, Gly Ala He Ser ser II. U. Gin Gly L.u «« Trp Ala 

100 105 

Gly Asn Asn Gly Met His val Ala Asn Leu S.r L.u My Ser Pro Ssr 

IIS 

Pro S.r Ala Thr Leu Glu Gin Ala Val Asn ser Ala Thr Ser Arg Gly 
130 135 



V.1 Leu val val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie S« 
145 150 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

Asn Asn Asn Arg Ala Ser Ph. Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 



Met Ala Pro Gly Val Asn lie Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

His 

215 220 



195 200 
Ala ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 



210 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 

225. 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 



245 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

( ii ) MOLECULE TYPE : protein 

(iii) HYPOTHETICAL: NO 
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i. 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: S139Y, A188P, V193M, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
l 5 10 15 ' 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 no 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Tyr Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 
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M. Asn A.n Ar, Ala Ser Phe ser Gin Tyr Gly Pro Gly Leu Asp II. 

180 185 

Met Ala Pro Gly Val A.n lie Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

195 zou 

Ala ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Vel Gin 11, 
225 230 

Ar, Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 * . , 

Tvr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORHATION FOR SEQ ID NO: 5: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: ^ _ 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vil) »W^ao«e» sl3SY _ A188P , V193H , V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 

25 JU 



20 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 ' 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 



Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

< 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Thr Ala Thr Leu Glu Gin Ala Val Asn Tyr Ala Thr Ser Arg Gly 
130 135 140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 



Met Ala Pro Gly Val Asn lie Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 



Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 
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Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

Tvr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 3° 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 9° 95 
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Asp Gly Arg Gly Ala lie Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Glh Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 ,160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 
180 185 190 

Met Ala Pro Gly Val Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

1 (A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

1 (B) CLONE: S3T, A188P, V193H 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Gin Thr Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 



20 



Thr Gly He Ser Thr His Pro Asp Leu Asn lie Arg Gly Sly Ala Ser 
3 5 

Phe val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 



85 



Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 



115 "0 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

Pro ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

val Leu val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 



145 



150 155 160 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 



165 
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65 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 
180 185 190 

Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala' Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S157T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr oi y n. s« - His W o as P ^ A.n n. at, ci, «, u. - 

35 

Phe Val Pro Oly «« Pro ser Thr Oln Asp Gly Ash Hi. Oly - 
50 

n ^ Tie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
His Val Ala Gly Thr - lie Ala Aia 75 80 

65 

Gly val .la Pro Ser Ala Olu «. Tyr Ala Val Lys Val Leu Cly Ala 

85 

^ 8l y Ar, «, Ala Xle Ssr ser lie Ala Oln Oly !-« «" 

dy Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Oly Ser Pro Ser 
115 wu 

Pro ser Ala T h r Leu olu Gin Ala Val Asn Ser Ala Thr Ser Ar, Oly 
130 135 

val Leu val val Ala Ala Ser Gly Asn Ser Oly Ala Thr Ser He Ser 
145 150 

^ Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Oly Ala Thr Asp Gin 

.an Asn Asn Ar, Ala Ser Phe ser Oln Tyr Gly Ala Oly Leu Asp 11. 
180 A ° 3 

val Ala Pro Gly Val Asn val Gin Ser Thr Tyr Pro Oly Ser Thr Tyr 
195 ZUU 

m _ yaf iia' Thr Pro His Val Ala Gly Ala 

Ala ser Leu Asn Gly Thr Ser Met Ala Tnr *r ^ 

210 215 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 
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Arq Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
* 245 250 255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: A188P, V193M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 



Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 



His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 



Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 
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«p Gly Ar, Gly Ala lie Ser Ser II. Ala Gin Gly Leu Glu Trp Ala 

100 1U3 



. His val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

Gly Asn Asn Gly Met His vai Aia 12g 

115 120 

Pro ser Ala Thr Lau Slu Gin Ala val Asn sar Ala Thr ser Ar, Gly 
130 135 

val Leu val Val Ala Ala Ser Gly Asn Ser Gly Ala ser Ser lie Ser 
145 150 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met AU val Gly Ala Thr Asp Gin 

165 

Asn Asn Asn Ar, Ala ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 



180 

Gly val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

195 



Met Ala Pro 

200 205 



225 

240 



Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 



Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 



245 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 



2) INFORMATION FOR SEQ ID NO: 10: 



( 2 ) 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A188P 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

* 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly 'Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 ' 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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it 

70 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala ser Leu Asn Gly Thr Ser Met Ala' Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
' 22 5 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
" 245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, V4I, A188P, V193M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ala Gin Thr He Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
l 5 10 . 15 

• His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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71 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

» 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 



Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 
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Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Sef Tttf Asn Leu 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

fvii) IMMEDIATE SOURCE: 
(B) CLONE: V193M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
! 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
3 5 40 *5 

Phe val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 



65 



70 75 80 



Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
3 85 90 95 
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Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu "GluTr^'iCiar 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 ,160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

$ SEM32 EST- 58 « 

(Vii) IMMEDIATE SOURCE: 
1 (B) CLONE: S104T 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:13: 

m Tie ser Arg Val Gin Ala Pro Ala Ala 

Ala Gin Ser Val Pro Trp Gly He Ser Arg va ^ 

1 5 

His Asn Ar 9 «, «~ Thr Cly Ser Sly val Lys val Ala Val Leu Asp 
Thr cly »e ser Thr His Pro Asp L.u Asn II. Ar, Gly .01, Ala Ser 

35 

Phe val Pro cly Clu Pro Sar Thr Cln Asp Sly Asn Sly Hi. Sly Tnr 
50 55 

His val Ala sly Thr lie Ala Ala Leu Asn Asn Ser He Oly Val Leu 
65 70 

Cly val Ala Pro Ser Ala Slu Leu Tyr Ala Val Lys Val Leu Sly Ala 

85 

Asp Oly Arg Cly Ala He Ser Tnr 11. Ala Cln Cly Leu «u Trp Ala 

100 1U3 

* nv Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
Gly Asn Asn Gly Met his vai 125 
115 i£V 

Pro ser Ala Thr Leu Clu Sin Ala Val Asn Ser Ala Thr Ser Ar, cly 
130 135 



val Leu val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala G1U Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: T69V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 * 55 60 

His Val Ala Gly Val lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 ~ 70 75 . 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asd Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 • 105 HO 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 
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Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO; 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: V4I, A188P, V193M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala Gin Ser He Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 



30 



Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 

4 0 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 

His val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 

70 75 80 

Gly val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 
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Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp -Ala 



100 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Ash Ser Ala Thr Ser Arg Gly 
130 135 140 

i i 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 
180 185 190 

Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 1 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A224V 



(xi) SEQUENCE DESCRIPTION: SEQ ID NQ:16: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 , 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 



Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 



His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 



Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 

100 105 . ' no 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 

Gin Ser Thr Tyr Pro Gly 
195 200 205 



Val 



180 « 5 

Ala Pro Gly val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 



. Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Val 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 * 33 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
l 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 ' 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 



Val Ala Pro Gly Val Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 



Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: V4I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ala Gin Ser He Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 4° 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 1° 5 11° 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 

145 150 155 160 



i I 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 

225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S3T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ala Gin Thr Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 is 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

» 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 5 5 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 1^5 HO 

-■lv Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
1 115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 l" 70 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg. 
260 265 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

fii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S139Y 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly II. S.r Thr Hi. Pro Asp Leu A.n lie Ar, Gly Gly Ale ,Ser 

35 40 

P h e vel Pro Gly Glu Pro ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 

His vel Al. Gly Thr lie Ale Ale Leu Asn Asn ser He Gly Vel Leu 

Gly vel Ale Pro Ser Al. Glu Leu Tyr Ale Vel Lys Vel Leu Gly Al. 

Asp Gly AT, Gly Al. Il« ser Ser II. Al. Gin Gly Leu Glu Trp Al. 
100 Au;? 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 



115 . 120 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Tyr Ala Thr Ser Arg Gly 
130 135 

val Leu vel Vel Al. Ale ser Gly Asn Ser Gly Ale Ser Ser lie Ser 

145 "0 1 

Tyr Pro Ale Arg Tyr Ale Asn Ale Met Al. v.l Gly Al. Thr Asp Gin 

Asn Asn Asn Ar, Al. Ser Phe Ser Gin Tyr Gly Ale Gly Leu Asp II. 
180 185 

val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 "0 235 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 

260 265 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus OSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N242A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 io 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asd 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 

35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 ' no 
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Gly Asn Asn Gly Met His v.! Ala Asn Leu S.r Leu Gly Ser Pro Ser 
115 120 

Pro ser Ala Thr Leu 01. Gin Ala val Asn ser Ala Thr Ser Aro Gly 

130 135 
Val Leu Val Val Ala Ala Ser Gly Asn ,Ser Gly Ala Ser Ser lie Ser 



145 



150 



Tyr Pro Ala Aro Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 

Asn Asn Asn Aro Ala ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 



195 



Ala ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 

215 



210 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 

Arg Ala His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 



245 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S236T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45. 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leii Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 
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Asn Asn Asn Aro Ala ser Phe Ser Gin Tyr Gly Al. Gly Leu Asp' lie 
180 185 

»,1 Ala Pro Gly val Asn Val Gin Ser Thr Tyr Pro Gly ser Thr Tyr 
195 ^ 

Ala ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 

210 . , 215 , " i 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Thr Asn Val Gin lie 
225 230 235 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 



245 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S36A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Ala Gin ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 1° 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 2 5 30 
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Thr Gly lie Ala Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 ' 95 ,, 

ten civ Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 Ho 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tvr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 * 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 

260 265 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : ^ j 

4 (A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
1 (B) CLONE: H243A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 



85 



Asp Gly Arg Gly Ala He Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 



100 



SUBSTITUTE SHEET 



WO 92/21760 PCT/US92/04306 

i, 

93 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

1 I., 

Tvr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 " 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn Ala Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

1 } (A) ORGANISM: Serine ^otease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A101T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Gin ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 

Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 

20 



His Asn Arg 

25 30 



Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 

Phe val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 

50 55 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 



Asp Gly Arg Gly Thr II. Ser Ser lie Al. Sin Gly Leu Glu trp Al. 

100 lo:> 
Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 



135 "0 



Pro ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 

Val Leu 
145 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 



165 



SUBSTITUTE SHEET 



WO 92/21760 95 PCT/US92/04306 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

Ala ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 22 ° 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 

225 230 2 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 



245 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S236A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 

35 <° 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 

50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 

Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

115 120 1*5 

pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 

130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 

145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 

180 I 85 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 

210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ala Asn Val Gin He 

225 230 235 240 

Ara Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: E87R 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Arg Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 HO 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 I 20 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 1©0 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 1 ? 5 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 I 85 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tvr Glv Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: ., 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: N114S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 



Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 



Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 lio 



Gly Ser Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 
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Xsn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 

val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

Ala ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A47W 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Ala Gin ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Levi Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Trp Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 



His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 



Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 no 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly. Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 * 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 " 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE:- protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A120S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
x 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 * 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Gly Asn Asn Gly Met His Val Ser Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

« 

val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 * 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(iij MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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i. 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease^ 

(B) STRAIN : Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: T56V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 " 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 3 

Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Val Gin Asp Gly Asn Gly His Gly Thr 

60 



50 



55 



His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 no 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
* 115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 "0 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 I 75 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp 'lie 
180 185 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 . 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 22 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 2,40 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A120V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 I 5 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly IU Ser Thr His Fro Asp Leu Asn II. Ar, Gly Gly Al. S.T 



35 40 



Ph. val Pro Gly Glu Pro ser Thr Gin Asp Gly Asn Gly His Gly Thr 

50 55 
His V.l Ala Gly Thr II. Al. Ala L«u Asn Asn Ser II. Gly V.l Lou 

65 70 

Gly val Ala Pro Ser Ala Glu Leu Tyr Al. V.l Lys V.l Leu Gly Al. 
8 5 

Asp Gly AT, Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Al. 
100 ±%JD 

Gly Asn Asn Gly Met His Val Val Asn Leu Ser Leu Gly Ser Pro Ser 



115 I 20 



130 135 



Pro ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 



145 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 * u:> 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

2) INFORMATION FOR SEQ ID NO: 33 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: G205V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 



Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 



His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 



Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 



Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 HO 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

• I., 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 



Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Val Ser Thr Tyr 
195 200 205 



Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ 10 NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S130A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Ala Gin Ser Val, Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
! 5 10 15 

I 1 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser. 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 * 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ala Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 



SUBSTITUTE SHEET 



WO 92/21760 110 PC17US92/04306 

Asn Asn Asn Ar, Ala ser Phe Ser Gin Tyr Cly Ala Gly Leu Asp Ifa 
val Ala Pro Gly v.l Asn v.l Gin s.r Thr Tyr Pro Gly Ser Thr Tyr 



195 



Ala ser Leu Asn Gly Thr ser Met Ala Thr Pro His V.l Al. Gly Ala 

215 " w 



210 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 



225 



230 



Arg Asn His Leu Lys Asn Thr Ala Thr ser Leu Gly Ser Thr Asn Leu 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ It NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S130T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ala Gin ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 
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Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 ■ 95, 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 HO 

Glv Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Thr Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 

145 150 155 160 

Tvr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
1 165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 * 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arq Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A96I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 " 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 * 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly lie 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
" 100 105 110 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 



Tyr Pro Ala Arg Tyr Ala Asn Ala Net Ala Val Gly Ala Thr Asp Gin 

165 170 175 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 



Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 



Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 



Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN : Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S104T, S139Y, A224V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 1° 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Thr He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Tyr Ala Thr Ser Arg Gly 
130 H5 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Uu Asp II* 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Val 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp ser Asn Val Gin lie 

225 230 * 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S139A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Aia Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala 2»r 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

< • ' . 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 " 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ala Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Vai Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp. Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S142T 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
! 5 10 15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 



Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 



His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 .70 75 80 



Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 



85 



90 



95 



Asp 



Gly 



Arg 



Gly Ala He 
100 



Ser 



Ser 



He 
105 



Ala 



Gin Gly Leu Glu Trp Ala 
110 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Thr Arg Gly 
130 I 35 140 

val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 I 50 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

* 165 170 x 0 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 I 85 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID N0:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S139T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

I ' . - 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 ' 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 HO 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Thr Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 

180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

i 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: I102W 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
! 5 " 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pto Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 1 95 

Asp Gly Arg Gly Ala Trp Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 



Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 



Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 



Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

♦ 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A96N 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 * 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Asn 
85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

i .1, 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 * * 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 * 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: N42F 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Ala Gin ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 

c ■ 10 A3 



His 



Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 

Thr Gly He Ser Thr His Pro Asp Leu Phe lie Arg Gly Gly Ala Ser 
35 4° 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
-» 90 »5 



85 



Asp Gly Arg Gly Ala He Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 



100 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
H5 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 I 70 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230. 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S142A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 ,95 , 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ala Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 ~ 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He . 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: H118F 



SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
5 10 15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 



Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 



His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 



Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 



Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 no 



Cxi) 

Ala 
1 
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Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 

val Leu Val Val Ala Ala Ser Gly As,n Ser Gly Ala Ser Ser lie Ser 
145 150 155 160. 
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Gly Asn Asn Gly Met Phe Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

Thr 
Ser 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp' Gin 
J 16 5 170 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 

Tvr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

' (iv) ANTI-SENSE: NO 



SUBSTITUTE SHEET 



WO 92/21760 129 PCT/US92/04306 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N237A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 * 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
. 115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Aro Glv 
130 135 140 9 7 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
M5 150 155 ifio 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr. 
19.5 200 205 

Ala ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Ala Val Gin He 
225 230 235 , 240 

Arc Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
_ 245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N255P 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

» 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 ... 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 * no 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arq Glv 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tvr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Pro Leu 

245 250 * 255 
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Tvr Glv Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 

* 260 .265 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: T141W, N237A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 " 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 



SUBSTITUTE SHEET 



WO 92/21760 PCT/US92/04306 

133 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

115 120 125 

Pro Ser Aia Thr Leu Glu Gin Ala Val Asn Ser Ala Trp Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Ala Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T268V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 
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Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Val Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: K229W 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
1 85 90 95 

Asd Glv Arg Gly Ala He Ser Ser Tie Ala Gin Gly Leu Glu Trp Ala 
P Y * 10 0 105 HO 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly' Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Trp Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 
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Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly ser^Tnr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease ' 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T141W 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 io is 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 gQ 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 



Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Glv Ala 

85 90 . 
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Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Le'u-Giu Trp 4 Ala 
100 105 HO 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Trp Ser Arg Gly 
130 135 140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 ,160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 



Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 



Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 



Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

• (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: vildtype 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Levi Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin' lie 
225 230 235 240 

Ara Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
* 245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, V4I, A188P, V193M, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GCGCAAACAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 
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TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 

360 , 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 



ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 



CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 



CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSN 5483 

(vii) IMMEDIATE SOURCE: 

s 
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(B) CLONE: S3T, A188P, V193H, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

GCGCAAACAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: V4I, A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GCGCAATCAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA ' AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 
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CTTGTCAATG CAGAAGCGGC AACACGC ' 
807 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S139Y, A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATTATGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 
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ii 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 
'(C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYF >THETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S130T, S139Y, A188P, V193M, V199I 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 
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AATTTGAGTT TAGGAAGCCC TTCGCCAACA GCCACACTTG AGCAAGCTGT TAATTATGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 
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GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, A188P, V193M 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GCGCAAACAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentUS DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S157T 

(Xi) SEQUENCE DESCRIPTION: SEQIDNO:61: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

* 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAC ATCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

720 . ' 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 
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(2) INFORMATION FOR SEQ ID NO: 62: i. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: A188P, V193M 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 
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GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A188P 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 
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ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAb S&t&k&Mie 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentUS DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, V4I, A188P, V193M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GCGCAAACAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT ctggtgtaaa agttgctgtc CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 
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GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: V193M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
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GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG Cf CAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S104T 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCA CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 



TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 



CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 67: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T69V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGGTTATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 
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GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAi- . 5 CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO:68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: V4I, A188P, V193M 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 

GCGCAATCAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 
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TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A224V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 
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GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTASAGG* 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG TTGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO:70: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 
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TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT' 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(Vi) ORIGINAL SOURCE: < 
(B) STRAIN: Bacillus lentue OSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: V4I 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

GCGCAATCAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

60 , 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 72: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN : Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S3T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

GCGCAAACAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 
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CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO , 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S139Y 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATTATGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 
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TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC l 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N242A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 
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GCAATCAGCT CGATTCCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 



CGCGCACATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus OSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S236T 



(xi) SEQUENCE DESCRIPTION: SI^Q ID NO:75: 

GCGCAATCAG TGCCATGGGG AATTAGCCGI GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 
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TTGACAGGTT CTGCTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCcXcHrCATCCAGAC 

.120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTG.CCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 , 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGACAAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(B) STRAIN: Bacillus lentus DSM 5483 «• 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S36A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTGCAAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGA!AT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

fvi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

fvii) IMMEDIATE SOURCE: 
(B) CLONE: H243A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 
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CGCAACGCAC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTgVa^TGG/^GCG<5X- 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid . 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A101T 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

GCGCAATCAG TGCCATGGGG AATTAGCGGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC . TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGCT 
300 

ACAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 • 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGCGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 
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GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S236A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 
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GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT* GCAC'G'TT^CT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

• i. 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGGCAAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: E87R 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
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TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGCGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGCG TCTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE:' 

(B) STRAIN: Bacillus lentus DSM 5483 
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(Vii) IMMEDIATE SOURCE: 
(B) CLONE: N114S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA GCAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 1 



TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT G AC ATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 



CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



SUBSTITUTE SHEeT 



174 

WO 92/21760 PCT/US92/04306 

(D) TOPOLOGY: linear '' 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A47W 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCTG GAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 
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CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A120S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 - ^ - .... 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTAGC 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 ' 
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GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: .nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T56V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCGTTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 
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AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A120V 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 
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TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGTT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 86: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
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(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:86: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
,180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGTTtCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) ' 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE : ,S13 OA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: 

GC6CAATCA6 TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAGCA GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 * 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 
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CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S130T 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAACA GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 
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AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A96I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAATTGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 
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AATTTGAGTT TACCAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT* TOATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S104T, S139Y, A224V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

180 
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GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCA CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATTATGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC ' 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG TTGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO' 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S139A 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 



TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 



GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

• h 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA. TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATGCAGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S142T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 . 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTACAAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600* 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 
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(2) INFORMATION FOR SEQ ID NO: 93: w 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentUS DSM 5483 

(vii) IMMEDIATE i DURCE: 
(B) CLONE: S139T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATACAGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
54 0 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 
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GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: I102W 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCATGGAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 
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ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

r . .: • 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus OSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A96N 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 



WO 92/21760 PCT/US92/04306 

190 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAAACGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
~ 480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N42F 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAXCCGTGSA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTATTTATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

i I. 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSN 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S142A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTGCAAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 99: 



SUBSTITUTE SHEET 



WO 92/21760 193 PCT/US92/04306 

(i) SEQUENCE CHARACTERISTICS: < 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

i 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: H118F 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GTTTGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 
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GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACJiXATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO ' 

(vi) ORIGINAL SOURCE: _ 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N237A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 
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TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCGC TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: N255P 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 
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GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA" ^CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGCCATTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: T141W, N237A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 
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TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

TGGTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCGC TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSN 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T268V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AGTTCGC 
807 

(2) INFORMATION FOR SEQ ID NO: 104: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 807 base pairs ., 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus OSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: K229W 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTTGGCAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 
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CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
" 780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 105: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: T141W 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

TGGTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 
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TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA 'CAACAAC^C 
540 



GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus lentus 

(B) STRAIN: DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: wild type 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 ' 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 
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GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGG"TAGACT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG, 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 
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What is claimed is: 

1. A mutant Bacillus lentus DSM 5483 protease derived by 
the replacement of at least one amino acid residue of the 
mature form of the Bacillus lentus DSM 5483 alkaline 

5 protease shown in SEQ ID NO: 52 wherein said one amino acid 
residue which is selected from the group consisting of 
Ser3, Val4, Ser36, Asn42, Ala47, Thr56, Thr69, Glu87, 
Ala96, AlalOl, Ilel02, Serl04, Asnll4, Hisll8, Alal20, 
Serl30, Serl39, Thrl41> Serl42, Serl57, Alal88, Vall93, 
10 Vall99, Gly205, Ala224, Lys229, Ser236, Asn237, Ash242, 
His243, Asn255, Thr2 68 is replaced with the amino acid 
residues listed in Table 2. 

* 

2. A mutant Bacillus lentus DSM 5483 protease of claim 1 
15 wherein the valine residue at position 199 is substituted 

by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, the valine residue at 
position 4 is substituted by isoleucine, and the serine 
20 residue at position 3 is substituted by threonine. 

3. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 199 is substituted 
by isoleucine, the valine residue at position 193 is 

25 substituted by methionine, the alanine residue at position 
188 is substituted by proline, and the serine residue at 
position 3 is substituted by threonine. 

4. A mutant Bacillus lentus DSM 5483 protease of claim 1 
30 wherein the valine residue at position 199 is substituted 

by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, and the valine residue at 
position 4 is substituted by isoleucine. 

35 

5. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 199 is substituted 
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by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, and the serine residue at 
position 139 is substituted by tyrosine. 

6. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the' Valine residue at position 199 is substituted 
by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, the serine residue at 
position 139 is substituted by tyrosine and the serine 
residue at position 130 is substituted by threonine. 

7. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 199 is substituted 
by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline. 

8. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 
by methionine , the alanine residue at position 188 is 
substituted by proline, and the serine residue at position 
3 is substituted by threonine. 

9. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 157 is substituted 
by threonine. 

10. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 
by methionine, and the alanine residue at position 188 is 
substituted by proline. 

11. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the alanine residue at position 188 is substituted 
by proline. 
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12. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 
by methionine, the alanine residue at position 188 is 
substituted by proline, the valine residue at position 4 is 

5 substituted by isoleucine and the serine residue at 
position 3 is substituted by threonine. 

13. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 

10 by methionine. . 

14. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 104 is substituted 
by threonine. 

15 

15. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the threonine residue at position 69 is substituted 
by valine. 

20 16. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 
by methionine, and the alanine residue at position 188 is 
substituted by proline, and the valine residue at position 
4 is substituted by isoleucine. 

25 

17. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the alanine residue at position 224 is substituted 
by valine. 

30 18. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 199 is substituted 
by isoleucine. 

.19. A mutant Bacillus lentus DSM 5483 protease of claim 1 
35 wherein the valine residue at position 4 is substituted by 
isoleucine. 
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20. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the serine residue at position 3 is substituted by 
threonine. 

5 21. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the serine residue at position 139 is substituted 
by tyrosine. 

22. A mutant Bacillus lentus DSM 5483 protease of claim l 
10 wherein the asparagine residue at position 242 is 

substituted by alanine. 

23. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 236 is substituted 

15 by threonine. 

24. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 36 is substituted by 
alanine. 

20 

25. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the histidine residue at position 243 is 
substituted by alanine. 

25 26. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the alanine residue at position 101 is substituted 
by threonine. 



30 



27. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 236 is substituted 
by alanine. 



28. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the glutamic acid residue at position 87 is 
35 substituted by arginine. 
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29. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the asparagine residue at position 114 is 
substituted by serine. 

5 30. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the alanine residue at position 47 is substituted 
by tryptophan. 

31. A mutant Bacillus lentus DSM 5483 protease of claim 1 
10 wherein the alanine residue at position 120 is substituted 

by serine. 

32. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the threonine residue at position 56 is substituted 

15 by valine. 

33. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the alanine residue at position 120 is substituted 
by valine. 

20 

34. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the glycine residue at position 205 is substituted 
by valine. 

25 35. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 130 is substituted 
by alanine. 



30 



36. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the serine residue at position 130 is substituted 
by threonine. 



37. A mutant Bacillus lentus DSM 5483 protease of claim i 
wherein the alanine residue at position 96 is substituted 
35 by isoleucine. 
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38. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the alanine residue at position 224 is substituted 
by valine, the serine residue at position 104 is 
substituted by threonine, and the serine residue at 

5 position 139 is substituted by tyrosine. 

39. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 139 is substituted 
by alanine. 

10 

40. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 142 is substituted 
by threonine. 

15 41. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 139 is substituted 
by threonine. 

42. A mutant Bacillus lentus DSM 5483 protease of claim 1 
20 wherein the isoleucine residue at position 102 is 

substituted by tryptophan. 

43. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the alanine residue at position 96 is substituted 

25 by asparagine. 

44. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the asparagine residue at position 42 is 
substituted by phenylalanine. 

30 

45. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 142 is substituted 
by alanine. 

35 46. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the histidine residue at position 118 is 
substituted by phenylalanine. 
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47. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the asparagine residue at position 237 is 
substituted by alanine. 

5 48. A mutant Bacillus lentus DSM 5483 prbtease of claim 1 
wherein the asparagine residue at position 255 is 
substituted by proline. 

* i. 

49. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the asparagine residue at position 237 is 
substituted by alanine and the threonine residue at 
position 141 is substituted by tryptophan. 

50. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the threonine residue at position 268 is 
substituted by valine. 

51. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the lysine residue at position 229 is substituted 
by tryptophan. 

52. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the threonine residue at position 141 is 
substituted by tryptophan. 

53. A mutant gene which encodes for a mutant Bacillus 
lentus DSM 5483 protease comprising in the direction of 
transcription a promoter, ribosomal binding site, 
initiation codon and the major portion of the pre region of 
the Bacillus licheniformis ATCC 53926 alkaline protease 
gene operably linked to a portion of the pre region and all 
of the pro and mature regions of the Bacillus lentus DSM 
5483 alkaline protease gene wherein one or more codons of 
said Bacillus lentus DSM 5483 alkaline protease gene are 
altered to produce a mutant gene which encodes for a 
protease derived by the replacement of at least one amino 
acid residue of the mature form of the Bacillus lentus DSM 
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5483 alkaline protease wherein said one amino acid residue 
which is selected from the group consisting of Ser3, Val4, 
Ser36, Asn42, Ala47, Thr56, Thr69, Glu87, Ala96, AlalOl, 
116102, serl04, Asnll4, Hisll8, Alal20, Serl30, Serl39, 
5 Thrl41, Serl42, Serl57, Alal88, Vall93, Vall99, Gly205, 

Ala224, Lys229, Ser236, Asn237, Asn242, His243, Asn255, 
Thr268 is replaced with the amino acid residues listed in 
Table 2. 

10 54. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 199 
is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
residue at position 188 is substituted by proline, the 

15 valine residue at position 4 is substituted by isoleucine, 
and the serine residue at position 3 is substituted by 
threonine. 

55. The mutant gene of claim 53 which encodes for said 
20 mutant protease wherein the valine residue at position 199 

is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
residue at position 188 is substituted by proline, and the 
serine residue at position 3 is substituted by threonine. 

25 

56. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 199 
is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 

30 residue at position 188 is substituted by proline, and the 
valine residue at position 4 is substituted by isoleucine. 

57. The mutant gene of claim 53 which encodes for said 
35 mutant protease wherein the valine residue at position 199 

is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
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residue at position 188 is substituted by proline, and the 
serine residue at position 139 is substituted by tyrosine, 

58 . The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 199 
is substituted by isoleucine,' the valine residue at 
position 193 is substituted by methionine, the alanine 
residue at position 188 is substituted by proline, the 
serine residue at position 139 is substituted by tyrosine 
and the serine residue at position 130 is substituted by 
threonine. 



10 



59. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 199 

15 is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
residue at position 188 is substituted by proline, 

60. The mutant gene of claim 53 which encodes for said 
20 mutant protease wherein the valine residue at position 193 

is substituted by methionine , the alanine residue at 
position 188 is substituted by proline, and the serine 
residue at position 3 is substituted by threonine. 

25 61. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 157 
is substituted by threonine. 

62 . The mutant gene of claim 53 which encodes for said 
30 mutant protease wherein the valine residue at position 193 

is substituted by methionine, and the alanine residue at 
position 188 is substituted by proline. 

63. The mutant gene of claim 53 which encodes for said 
35 mutant protease wherein the alanine residue at position 188 

is substituted by proline. 
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64. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 193 
is substituted by methionine, the alanine residue at 
position 188 is substituted by proline, the valine residue 

5 at position 4 is substituted by isoleucine and the serine 
residue at position 3 is substituted by threonine. 

65. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 193 

10 is substituted by methionine. 

66. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 104 
is substituted by threonine. 

15 

67. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the threonine residue at position 
69 is substituted by valine. 

20 68. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 193 
is substituted by methionine, and the alanine residue at 
position 188 is substituted by proline, and the valine 
residue at position 4 is substituted by isoleucine. 

25 

69. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the alanine residue at position 224 
is substituted by valine. 

30 70. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 199 
is substituted by isoleucine. 

71. The mutant gene of claim 53 which encodes for said 
35 mutant protease wherein the valine residue at position 4 is 
substituted by isoleucine. 
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72. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 3 is 
substituted by threonine. 

73. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 139 
is substituted by tyrosine. 

74. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the asparagine residue at position 
242 is substituted by alanine. 



75. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 236 

15 is substituted by threonine. 

76. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 36 
is substituted by alanine. 

20 

77. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the histidine residue at position 
243 is substituted by alanine. 

25 78. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the alanine residue at position 101 
is substituted by threonine. 

79. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 236 
is substituted by alanine. 



80. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the glutamic acid residue at 
35 position 87 is substituted by arginine. 



SUBSTITUTE SHEET 



WO 92/21760 PCT/US92/04306 

214 

81. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the asparagine residue at position 
114 is substituted by serine. 

5 82. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the alanine residue at position 47 
is substituted by tryptophan. 



10 



83. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the alanine residue at position 120 
is substituted by serine. 

84. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the threonine residue at position 

15 56 is substituted by valine. 

85. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the alanine residue at position 120 
is substituted by valine. 

20 

86. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the glycine residue at position 205 
is substituted by valine. 

25 87. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 130 
is substituted by alanine. 



30 



88. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 130 
as substituted by threonine. 

89. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the alanine residue at position 96 

35 is substituted by isoleucine. 
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99. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the asparagine residue at position 
237 is substituted by alanine. 

5 100. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the asparagine residue at position 
255 is substituted by proline. 

101. The mutant gene of claim 53 which encodes for said 
10 mutant protease wherein the asparagine residue at position 

237 is substituted by alanine and the threonine residue at 
position 141 is substituted by tryptophan. 

102. The mutant gene of claim 53 which encodes for said 
15 mutant protease wherein the threonine residue at position 

268 is substituted by valine. 

103 . The mutant gene of claim 53 which encodes for said 
mutant protease wherein the lysine residue at position 229 

20 is substituted by tryptophan. 

104. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the threonine residue at position 
141 is substituted by tryptophan. 

25 

105. A hybrid plasmid capable of replication in Bacillus 
comprised of a gene which encodes for a mutant Bacillus 
lentus DSM 5483 protease comprising in the direction of 
transcription a promoter, ribosomal binding site, 

30 initiation codon and the major portion of the pre region of 
the Bacillus licheniformis ATCC 53926 alkaline protease 
gene operably linked to a portion of the pre region and all 
of the pro and mature regions of the Bacillus lentus DSM 
5483 alkaline protease gene followed by a 164 bp DNA 

35 fragment containing the transcription terminator from the 
ATCC 53926 alkaline protease gene wherein one or more 
codons of said Bacillus lentus DSM 5483 alkaline protease 
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gene are altered to produce a mutant gene which encodes for 
a protease derived by the replacement of at least one amino 
acid residue of the mature form of the Bacillus lentus DSM 
5483 alkaline protease wherein said one amino acid residue 
which is selected from the group consisting of Ser3, Val4, 
Ser36, Asn42, Ala47, Thr56, Thr69, G1U87, Ala96, AlalOl, 
Ilel02, Serl04, Asnll4, Hislis, Alal20, Serl30, Serl39, 
Thrl41, Serl42, Serl57, Alal88, Vall93, Vall99, Gly205, 
Ala224, Lys229, Ser236, Asn237, Asn242, His243, Asn255, 
Thr268 is replaced with the amino acid residues listed in 
Table 2. 



106. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 

15 residue at position 199 is substituted by isoleucine, the 
valine residue at position 193 is substituted by 
methionine, the alanine residue at position 188 is 
substituted by proline, the valine residue at position 4 is 
substituted by isoleucine, and the serine residue at 

20 position 3 is substituted by threonine. 

107. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 199 is substituted by isoleucine, the 
25 valine residue at position 193 is substituted by 
methionine, the alanine residue at position 188 is 
substituted by proline, and the serine residue at position 

3 is substituted by threonine. 

30 108. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 199 is substituted by isoleucine, the 
valine residue at position 193 is substituted by 
methionine, the alanine residue at position 188 is 

35 substituted by proline, and the valine residue at position 

4 is substituted by isoleucine. 
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109. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 199 is substituted by isoleucine, the 
valine residue at position 193 is substituted by 

5 methionine, the alanine residue at position 188 is 
substituted by proline, and the serine residue at position 
139 is substituted by tyrosine. 

110. The hybrid plasmid of claim 105 wherein said mutant 
10 gene encodes for said mutant protease wherein the valine 

residue at position 199 is substituted by isoleucine, the 
valine residue at position 193 is substituted by 
methionine, the alanine residue at position 188 is 
substituted by proline, the serine residue at position 139 
15 is substituted by tyrosine and the serine residue at 
position 130 is substituted by threonine. 

111. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 

20 residue at position 199 is substituted by isoleucine, the 

valine residue at position 193 is substituted by 

methionine, the alanine residue at position 188 is 
substituted by proline. 

25 112. The hybrid plasmid of claim 105 wherein said mutant 

gene encodes for said mutant protease wherein the valine 
residue at position 193 is substituted by methionine , the 
alanine residue at position 188 is substituted by proline, 
and the serine residue at position 3 is substituted by 

30 threonine. 

113. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 157 is substituted by threonine. 

35 

114. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
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residue at position 193 is substituted by methionine, and 
the alanine residue at position 188 is substituted by 
proline. 

5. 115. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue dt position 188 is substituted by proline. 

116. The hybrid plasmid of claim 105 wherein said mutant 
10 gene encodes for said mutant protease wherein the valine 

residue at position 193 is substituted by methionine, the 
alanine residue at position 188 is substituted by proline, 
the valine residue at position 4 is substituted by 
isoleucine and the serine residue at position 3 is 
15 substituted by threonine. 

117. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 193 is substituted by methionine. 

118. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 104 is substituted by threonine. 

119. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the threonine 
residue at position 69 is substituted by valine. 

120. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 193 is substituted by methionine, and 
the alanine residue at position 1*8 is substituted by 
proline, and the valine residue at position 4 is 
substituted by isoleucine. 



20 
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121. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 224 is substituted by valine. 

5 122. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 199 is substituted by isoleucine. 

123. The hybrid plasmid of claim 105 wherein said mutant 
10 gene encodes for said mutant protease wherein the valine 

residue at position 4 is substituted by isoleucine. 

124. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 

15 residue at position 3 is substituted by threonine. 

125. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 139 is substituted by tyrosine. 

126. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 
asparagine residue at position 242 is substituted by 
alanine. 



20 
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127. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 236 is substituted by threonine. 

128. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 36 is substituted by alanine. 

129. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the histidine 
residue at position 243 is substituted by alanine. 
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130. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 101 is substituted by threonine. 

131. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 236 is substituted by alanine. . 

132. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the glutamic 
acid residue at position 87 is substituted by arginine. 

133. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 
asparagine residue at position 114 is substituted by 
serine. 

134. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 47 is substituted by tryptophan. 

135. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 120 is substituted by serine. 

136. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the threonine 
residue at position 56 is substituted by valine. 

137. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 120 is substituted by valine. 

138. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the glycine 
residue at position 205 is substituted by valine. 
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139. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 130 is substituted by alanine. 

140. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 130 is substituted by threonine. 

141. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 96 is substituted by isoleucine. 

# 

142. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 224 is substituted by valine, the 
serine residue at position 139 is substituted by tyrosine 
and the serine residue at position 104 is substituted by 
threonine. 

143. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 139 is substituted by alanine. 

144. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 142 is substituted by threonine. 

145. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 

30 residue at position 139 is substituted by threonine. 

146. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 
isoleucine residue at position 102 is substituted by 

35 tryptophan. 
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147. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 96 is substituted by asparagine. 

5 148. The hybrid plasmid of claim 105 wherein said mutant 

gene encodes for said mutant protease wherein the 

asparagine residue at position 42 is substituted by 

phenylalanine. ' 

10 149. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 142 is substituted by alanine. 

150. The hybrid plasmid of claim 105 wherein said mutant 
15 gene encodes for said mutant protease wherein the histidine 

residue at position 118 is substituted by phenylalanine. 

151. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 

20 asparagine residue at position 237 is substituted by 
alanine. 

152. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 

25 asparagine residue at position 255 is substituted by 
proline. 

153. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 

30 asparagine residue at position 237 is substituted by 
alanine and the threonine residue at position 141 is 
substituted by tryptophan. 

154. The hybrid plasmid of claim 105 wherein said mutant 
35 gene encodes for said mutant protease wherein the threonine 

residue at position 268 is substituted by valine. 
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155. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the lysine 
residue at position 229 is substituted by tryptophan. 

5 156. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the threonine 
residue at position 141 is substituted by tryptophan. 

157. A computer based method for identifying the sites 
10 which affect the stability of a target protein comprising 
the steps of: (1) generating a probe-accessible surface of 
said target protein by probing the three dimensional 
coordinates of said protein with an uncharged probe 
molecule having a radius of about 0.9 to about 2.0 A, 
15 wherein said probe-accessible surface has an external 
surface the interior of which contains one or more probe- 
accessible internal cavities; (2) identifying the amino 
acids which make up the boundaries of the internal 
cavities, wherein said amino acids comprise a set of sites 
20 which when mutated increase the ^stability of the protein. 

158. The computer based method of claim 157 wherein said 
target protein is Bacillus lentus DSM 5483 alkaline 
protease. 

25 

159. A computer based method for identifying the sites 
which affect the stability of a target protein comprising 
the steps of: (1) generating a probe-accessible surface of 
said target protein by probing the three dimensional 

30 coordinates of said protein with an uncharged probe 
molecule having a radius of about 0.9 to about 2.0 A, 
wherein said probe-accessible surface has an external 
surface the interior of which contains one or more probe- 
accessible internal cavities; (2) aligning said three 

35 dimensional coordinates of said target protein and a 
reference protein by moving the three dimensional 
coordinates of said reference protein into the coordinate 
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frame of said target protein; (3) identifying an amino acid 
in said reference protein whose side chain lies outside 
said solvent-accessible surface of said protein or inside 
said internal cavities of said target protein. 

5 

160. The computer based method of claim 159 wherein said 
target protein is Bacillus lentus DSM 5483 alkaline 
protease. 

10 161. The computer based method of claim 159 wherein said 
reference protein is any protein for which a three 
dimensional structure is available which is homologous to 
the target protein. 

15 162. The computer based method of. claim 159 wherein said 
reference protein is thermitase. 

163. The computer based method of claim 159 wherein said 
reference protein is subtilisin Carlsberg. 

20 

164. The computer based method of claim 159 wherein said 
reference protein is subtilisin BPN*. 

165. The computer based method of claim 159 wherein said 
25 reference protein is proteinase K. 

166. A. computer based method for increasing the stability 
of a protein comprising the steps of: (1) generating a 
probe-accessible surface of said target protein by probing 

30 the coordinates of said protein with an uncharged probe 
molecule having a radius of about 0.9 to about 2.0 A, 
wherein said probe-accessible surface has an external 
surface the interior of which contains one or more probe- 
accessible internal cavities; (2) identifying the amino 

35 acids which make up the boundaries of the internal 
cavities, wherein said amino acids comprise a set of sites 
which when mutated increase the stability of the protein; 
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(3) identifying an amino acid mutation which decreases the 
volume of said internal cavities; (4) determining if said 
amino acid in said target protein can be changed without 
creating unacceptable steric interactions; (5) replacing 
5 the amino acid in said target protein by site directed 
mutagenesis of the gene which expresses said target 
protein. 

167. The computer based method of claim 166 wherein said 
10 target protein is Bacillus lentus DSM 5483 alkaline 

protease. 

168. The computer based method of claim 166 wherein said 
reference protein is any protein for which a three 

15 dimensional structure is available which is homologous to 
the target protein. 

169. The computer based method of claim 166 wherein said 
reference protein is thermitase. 

20 

170. The computer based method of claim 166 wherein said 
reference protein is subtilisin Carlsberg. 

171. The computer based method of claim 166 wherein said 
25 reference protein is subtilisin BPN 1 . 

172. The computer based method of claim 166 wherein said 
reference protein is proteinase K. 

30 173. A computer based method for increasing the stability 
of a protein comprising the steps of: (1) generating a 
probe-accessible surface of said target protein by probing 
the coordinates of said protein with an uncharged probe 
molecule having a radius of about 0.9 to about 2.0 A, 

15 wherein said probe-accessible surface has an external 
surface the interior of which contains one or more probe- 
accessible internal cavities; (2) aligning said three 
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dimensional coordinates of said target protein and a 
reference protein by moving the three dimensional 
coordinates of said reference protein into the coordinate 
frame of said target protein; (3) identifying an amino acid 
5 in said reference protein whose side chain lies outside 
said solvent-accessible surface of said protein or inside 
said internal cavities of said target protein; 
(4) identifying the amino acid in said target protein which 
occupies the equivalent position as said amino acid in said 
10 reference protein; (5) determining if said amino acid in 
said target protein can be changed without creating 
unacceptable steric effects; (6) replacing the aad.no acid 
in said target protein with the corresponding amino acid in 
the equivalent position in said reference protein by site- 
15 directed mutagenesis of the gene which expresses said 
target protein. 

174. The computer based method of claim 173 wherein said 
target protein is Bacillus lentus DSM 5483 alkaline 

20 protease. 

175. The computer based method of claim 173 wherein said 
reference protein is any protein for which a three 
dimensional structure is available which is homologous to 

25 the target protein. 

176. The computer based method of claim 173 wherein said 
reference protein is thermitase. 

30 177. The computer based method of claim 173 wherein said 
reference protein is subtilisin Carlsberg, 

178. The computer based method of claim 173 wherein said 
reference protein is subtilisin BPN». 

)5 

179. The computer based method of claim 173 wherein said 
reference protein is proteinase K. 
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1 CLY C 27.9B5 27.065 7.578 

1 QVt O 26.834 26.692 7.822 

1 OhX n 27.785 25.660 5.657 

1 OL* CA 28.517 26.825 6.143 

2 CLN H 28.745 27.585 8.522 
\ cS CA 28.205 27.868 9.851 
2 oS CB 29.179 27.265 10.835 
2 OLN CO 28.905 27.589 12.287 
2cu CD 29.834 26.805 13.151 
I S3 oS 29.476 25.685 13.540 

1 HB2 31.008 27.317 13.461 

2 CLM C 28.045 29.384 10.049 
I GUI O 28.927 30.159 9.642 
i SBR M 26.940 29.781 10.693 

3 III cS 2*568 31.160 10.999 
3 SBR CB 25.036 31.390 10.712 
III* OG 24.576 30.913 9.455 
3 SBR C 26.815 31.424 12.488 

3 SBR O 26.464 30.580 13.314 

4 VAL It 27.371 32.570 12.897 
4 VAL CA 27.534 32.913 14.309 
4 VAL CB 28.860 33.625 14.552 
I^EcCl 29.008 33.965 16.045 
IvSSb 30.006 32.739 14.035 
4 VAL C 26.397 33.869 14.65S 

4 VAL O 26.344 34.990 14.097 
Jp£ H 25.384 33.471 15.449 

5 PRO CD 25.140 32.114 15-924 
S PRO CA 24.313 34.393 15.856 
t 3S £ 23.404 33.524 16.740 
5 PRO CG 23.629 32.110 36.189 
I »U C 24.823 35.677 16.538 
I p£ O 25.816 35.601 17.282 
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17 HIS CB 36.263 29.667 20.715 
17 RIB M 34.810 29.669 21.066 
17 HIS CD 2 33.823 29.667 20.140 
17 HIS KOI 34.240 29.5S7 22.260 
17 HX8 CB1 32.932 29.701 22.062 
17 HIS NB2 32.694 29.881 20.607 
17 HIS C 38.577 30.109 21.246 

17 HIS 0 39^290 29.114 21.115 

18 ASM N 36.976 31.354 20.903 
18 ASN CA 40.320 31.583 20.379 
18 ASN CB 40.420 32.976 19.792 
18 ASN CO 39.771 33.007 16.426 
18 ASN 001 39.324 34.072 17.991 
16 ASN ND2 39.604 31.952 17.631 
18 ASN C 41.377 31.382 21.454 

18 ASN O 42.545 31.105 21.147 

19 ARG N 41.007 31.481 22.726 
19 ARC CA 41.934 31.108 23.756 
19 ARC CB 41.579 31.808 25.055 
19 ARG CO 41.755 33.269 24.901 
19 AM CO 41.327 33.963 26.212 
19 ARG NI 41.469 35.388 26.008 
19 ARG CX 40.620 36.280 26.485 
19 ARG NH1 40.880 37.535 26.211 
19 ARG NH2 39.567 35.963 27.217 
19 ARG C 41.924 29.600 23.992 

19 ARG O 42.655 29.144 24.864 

20 GLY N 41.166 28.766 23.312 
20 GLY CA 41.105 27.344 23.620 
20 GLY C 40.056 26.959 24.682 

20 GLY O 40.026 25.824 25.187 

21 LEU N 39.130 27.872 25.003 
21 LEU CA 38.098 27.626 26.023 
21 LEU CB 38.012 26.796 26.984 
21 LEU CG 39.321 29.049 27.732 
21 LEU CD1 39.370 30.463 28.219 
21 LEU CD2 39.469 28.017 28.815 
21 LEU C 36.767 27.463 25.284 

21 LEU O 36.254 28.371 24.622 

22 TOR N 36.294 26.227 25.368 
22 THR CA 35.094 25.767 24.713 
22 THR CB 35.488 24.785 23.658 
22 THR OG1 36.139 23.695 24.331 
22 THR CG2 36.341 25.467 22.585 
22 THR C 34.069 25.126 25.622 

22 THR O 33.010 24.745 25.146 

23 GLY N 34.304 24.953 26.918 
23 GLY CA 33.327 24.232 27.761 
23 GLY C 33.680 22.769 27.973 

23 GLY O 32.931 22.033 28.642 

24 SER N 34.808 22.329 27.403 
24 SER CA 35.218 20.939 27.546 
24 SER CB 36.565 20.776 26.874 
24 SER OG 36.819 19.378 26.828 
24 SER C 35.310 20.485 29.016 

24 SER O 35.830 21.218 29.880 

25 GLY N 34.786 19,290 29.245 
25 GLY CA 34.688 18.702 30.571 
25 GLY C 33.657 19.387 31.517 

25 GLY O 33.562 19.016 32.697 

26 VAL N 32.861 20.356 31.079 
26 VAL CA 31.862 20.949 31.956 
26 VAL CB 31.863 22.501 31.794 
26 VAL CGI 30.812 23.111 32.729 
26 V.U CO 2 3:. 281 23.05£ 3V 0.71 
26 VAL C 30.488 20.382 31.604 
26 VAL O 30.089 20.375 30.446 



27 LYS M 29.799 19.815 32.589 

27 LtS CA 26.459 19.291 32.434 

27 LYS CB 28.206 16.148, 33.370 

27 LYS CO, 29.146 17.001 33.191 

27 LYS CD 28.427 15.942 33.969 

27 LYS CI 29.530 15.137 34.529 

27 LYS Ml 29.022 14.047 35.345 

27 LYS C 27.394 20.331 32.719 

27 LYS O 27.368 20.968 33.797 

28 VAL « 26.512 20.472 31.730 

11™ 25435 21 - 471 31.738 

28 VAL CB 25.628 22.534 30.583 

28 VAL COl 24.502 23.560 30.596 

28 VAL CC2 26.989 23.220 30.749 

28 VAL C 24.121 20.739 31.512 

28 VAL O 23.947 20.067 30.475 

29 ALA M 23.203 20.933 32.446 

11 ^ £ 2?- 900 20 - 3 " ".385 

29 ALA CB 21.478 19.832 33.763 

29 ALA C 20.90* 21.382 31.920 

29 ALA O 20.919 22.490 32.454 

30 VAL K 20.038 21.127 30.938 
30 VAL CA 19.069 22.069 30.421 
30 VAL CB 19.123 22.097 28.835 
30 VAL OG1 18.017 22.967 28.267 
30 VAL OG2 20.480 22.654 28.369 

.30 VAL C 17.731 21.519 30.928 

30 VAL 0 17.275 20.467 30.425 

31 LEU N 17.155 22.192 31.928 
31 LEU CA 15.899 21.751 32.514 
31 LEU CB 15.878 22.118 33.997 
31 LEU CG 16,523 21.135 34.997 
31 LEU CD1 18.034 21.230 34.826 
31 LEU CM 16.177 21.487 36.457 
31 LEU C 14.832 22.501 31.724 

31 LEU 0 14.647 23.705 31.887 

32 ASP N 14.163 21.816 30.801 
32 ASP CA 13.254 22.474 29.860 
32 ASP CB 14.173 23.197 28.850 
32 ASP CG 13.567 24.470 28.221 
32 ASP 001 14.128 25.565 28.394 
32 ASP O02 12.549 24.352 27.538 
32 ASP C 12.331 21.405 29.226 

32 ASP O 12.057 20.382 29.870 

33 THR N 11.874 21.602 27.972 
33 THR CA 10.956 20.709 27.245 
33 THR CB 10.237 21.562 26.131 
33 THR OC1 11.275 22.099 25.255 
33 THR CC2 9.394 22.669 26.737 
33 THR C 11.600 19.46S 26.594 

33 THR O 10.948 18.766 25.806 

34 CLY N 12.919 19.306 26.830 
34 GLY CA 13.720 18.216 26.294 
34 GLY C 14.758 18.794 25.334 

34 GLY O 14.875 20.030 25.242 

35 XLS N 15.492 17.921 24.630 
35 ILE CA 16.417 18.299 23.557 
35 ILE CB 17.881 18.366 24.013 
35 ILE CG2 18.614 19.017 22.822 
35 ILE CGI 18.149 19.249 25.273 
35 ILE CD 19.589 19.096 25.859 
35 ILE C 16.257 17.256 22.439 

35 ILE O 16.348 16.042 22.687 

36 SER n 15.873 17.729 21.243 
36 SER CA 15.797 1S.OJG' 2O.099 
36 SER CB 14.885 17.400 19.036 
36 SER OG 13.589 17.293 19.580 
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36 SBR C 17.166 16.572 19.462 

36 SBR O 16.018 17*473 19.331 

37 THR * 17.360 15. 298 19.076 
37 THR CA 16.541 14.930 18,274 
37 THR CB 18.300 13.522 17.755 
37 THR OOl 18.169 12.722 16.926 
37 THR O02 19.401 13.039 16.808 
37 THR C 18.675 15.912 17.089 

37 THR O 17.670 16.153 16.374 

38 BIS M 19.880 16.435 16.637 
38 HZ8 CA 20.021 17.474 15.806 
38 HZ8 CB 19.766 16.668 16.461 
38 HZ8 CO 19.722 20.046 15.486 
38 HI8 CD 2 20.803 20.545 14.801 
38 HIS HD1 18.655 20.775 15.114 
38 HIS CE1 19.051 21.670 14.239 
38 HIS K2 20.346 21.530 14.048 
38 BIS C 21.432 17.344 15.305 

38 HIS O 22.341 17.174 16.118 

39 PRO H 21.740 17.555 14.025 
39 PRO CD 20.795 17.752 12.918 
39 PRO CA 23.135 17.467 13.571 
39 PRO CB 23.084 17.619 12.070 
39 PRO OS 21.744 18.261 11.799 
39 PRO C 24.112 18.457 14.195 

39 PRO O 25.318 18.260 14.162 

40 ASP H 23.645 19.520 14.832 
40 ASP CA 24.583 20.486 15.375 
40 ASP CB 24.216 21.897 14.900 
40 ASP CG 25.453 22.801 14.740 
40 ASP OOl 26.526 22.264 14.551 
40 ASP OD2 25.389 24.037 14.740 
40 ASP C 24.561 20.439 16.874 

40 ASP O 24.916 21.450 17.480 

41 LBU H 24.080 19.327 17.430 
41 LBU CA 24.102 19.142 18.883 
41 LSU CB 22.713 19.260 19.513 
41 LED CG 21.938 20.541 19.465 
41 LSU CD1 20.485 20.249 19.882 
41 LSU CD2 22.642 21.595 20.331 

41 LSD C 24.635 17.780 19.265 
. 41 LBU 0 O 24.417 16.602 18.530 

42 ASK M 25.298 17.707 20.415 
42 ASK CA 25.792 16.443 20.953 
42 ASK CB 27.341 16.452 21.066 
42 ASH CG 27.960 15.195 21.667 
42 ASK OOl 29.168 15.169 21.967 
42 ASK HD2 27.260 14.090 21.803 
42 ASH C 25.176 16.272 22.354 

42 ASK O 25.590 16.890 23.332 

43 ILK H 24.152 15.442 22.457 
43 ILB CA 23.456 15.252 23.736 
43 ILB CB 21.958 15.077 23.423 
43 ILB CG2 21.208 14.865 24.766 
43 ILB CGI 21.451 16.284 22.605 
43 ILB CD 20.150 16.044 21.857 
43 ILB C 24.075 14.023 24.422 

43 ILB O 24.160 12.963 23.781 

44 ARG K 24.520 14.131 25.675 
44 ARG CA 25.246 13.030 26.309 
44 ARG CB 26.332 13.557 27.250 
44 ARG CG 27.060 14.753 26.730 
44 ARG CD 27.731 14.330 25.467 
44 ARG KB 29.007 13.812 25.844 
44 ARG CZ 30.106 14.554 25.653 
44 ARG KK1 31.2/4 14.034 <6.023 
44 ARG KH2 30.099 15.758 25.065 



44 ARC C 24.399 12.068 27.123 

44 AR0 O 24.863 11.030 27.534 

45 CLT K 23.166 12.469 27.392 
45 CLT CA 22.266 11.766 26.306 
45 CLT C 21.220 12.697 26.867 

45 GLY O 21.009 13.624 28.377 

46 CLT K 20.524 12.208 29.871 
46 CLT CA 19.453 12.976 30.489 
46 CLT C 16.430 12.113 31.221 

46 CLT O 18.632 10.912 31.445 

47 ALA H 17.313 12.744 31.556 
47 ALA CA 16.222 12.120 32.291 
47 ALA CB 16.461 12.192 33.779 
47 ALA C 14.953 12.896 31.997 

47 ALA O 15.007 14.081 31.604 

48 SKR K 13.817 12.215 32.075 
48 SBR CA 12.537 12.888 31.947 
48 SXR CB 11.680 12.343 30.801 
46 SBR OG 10.390 12.945 30.642 
48 SBR C 11.760 12.680 33.243 

48 SBR O 11.740 11.558 33.791 

49 PHB H 11.224 13.808 33.696 
49 PHI CA 10.358 13.621 34.885 
49 PHB CB 10.967 14.782 35.924 
49 PHB CG 12.302 14.253 36.403 
49 PHB CD1 13.454 14.844 35.923 
49 PHX CD2 12.383 13.128 37.204 

•49PHSC81 14.676 14.300 36.225 
49 PHB C82 13.616 12.590 37.509 
49 PHB Ct 14.760 13.176 37.006 
49 PHB C 8.915 14.206 34.546 

49 PHB O 8.115 14.601 35.418 

50 VAL H 6.571 14.104 33.248 
50 VAL CA 7.230 14.424 32.796 
50 VAL CB 7.264 15.245 31.450 
50 VAL CGI 5.869 15.427 30.821 
50 VAL CG2 7.766 16.635 31.755 
50 VAL C 6.512 13.085 32.594 

50 VAL O 6.694 12.336 31.695 

51 PRO K 5.443 12.724 33.315 
51 PRO CD 4.826 13.553 34.344 
51 PRO CA 4.805 11.411 33.232 
51 PRO CB 3.632 11.476 34.216 
51 PRO CG 4.118 12.525 35.235 
51 PRO C 4.358 10.971 31.854 

51 PRO O 4.621 9.848 31.454 

52 GLT K 3.693 11.820 31.082 
52 CLT CA 3.269 11.377 29.746 
52 CLT C 4.368 11.323 28.690 

52 CLT O 4.117 10.848 27.575 

53 CLO H 5.606 11.757 28.996 
S3 CLO CA 6.645 11.848 28.005 
53 GLU CB 6.909 13.311 27.676 
53 CLO CG 5.740 13.985 27.008 
53 CLO CD 5.991 15.433 26.597 
53 GLU OBI 7.145 15.826 26.393 
S3 CLO 0S2 5.012 16.167 26.462 
S3 CLO C 7.901 11.202 28.519 

53 CLO O 8.803 11.919 28.919 

54 PRO N 8.059 9.880 28.483 
54 PRO CD 7.103 8.945 27.908 
54 PRO CA 9.245 9.200 29.004 
54 PRO CB 8.817 7.745 28.993 
54 PRO CC 7.964 7.702 27.752 
54 PRO C 10.548 9.487. 28.240 

54 PRO O 11.625 9.<i/* *4.750 

55 SBR N 10.497 10.048 27.015 
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SS 8ER CA. 11.678 10.360 26.197 
SS 8BA CB 11.310 10.444 24.730 
SS BMM 00 12.390 10.759 23.670 
SS SCR C 12.250 11.702 26.559 

55 8XR o 11.469 12.540 27.001 

56 THR N 13.533 11.966 26.265 
56 THR CA 14;084 13.315 26.487 
56 THR CB 15.596 13.250 26.945 
56 THR OG1 16.283 12.433 25.998 
56 THR C02 15.743 12.741 28.390 
56 THR C 13.976 14.192 25.225 

56 THR O 14.370 15.358 25.250 

57 CLH H 13.331 13.623 24.170 
57 GLM CA 13.252 14.317 22.886 
57 CLN CB 12.743 13.375 21.797 
57 CUI CO 13.825 12.370 21.360 
57 GLM CO 15.108 13.013 20.762 
57 GLM OBI 15.091 13.752 19.766 
57 CU» HB2 16.267 12.793 21.390 
57 GUT C 12.314 15.495 23.027 

57 GUI O 11.395 IS. 425 23.858 

58 ASP M 12.S08 16.545 22.256 
SB ASP CA 11.724 17.738 22.4S1 
58 ASP CB 12.619 18.910 22.214 
SB ASP CO 12.036 20.302 22.427 
58 ASP OD1 10.9S0 20.447 23.006 
58 ASP OD2 12.737 21.245 22.032 
58 ASP C 10.499 17.854 21.573 

58 ASP O 10.627 18.076 20.358 

59 GLY N 9.311 17.809 22.191 
59 GLJf CA 6.021 17.992 21.500 
59 GLT C 7.601 19.445 21.318 

59 GLT O 6.527 19.731 20.754 

60 ASN N 8.431 20.374 21.802 
60 ASN CA 8.085 21.787 21.793 
60 ASN CB 8.166 22.340 23.222 
60 ASN CO 7.768 23.804 23.268 
60 ASN OOl 8.565 24.702 23.090 
60 ASN ND2 6.503 24.085 23.545 
60 ASN C 8.971 22.642 20.883 

60 ASN O 8.525 23.378 20.022 

61 GLY N 10.269 22.585 21.093 
61 GLY CA 11.202 23.372 20.337 
61 GLY C 12.035 24.187 21.318 

61 GLY O 13.231 24.429 21.115 

62 HZS N 11.417 24.583 22.439 
62 BIS CA 12.068 25. SIS 23.336 
62 HIS CB 11.034 25.886 24.385 
62 BIS CO 11.450 27.020 25.268 
62 BIS CD2 11.218 28.363 25.048 
62 BIS ND1 11.969 26.858 26.498 
62 HIS CB1 12.011 28.039 27.067 
62 HIS NB2 11.572 28.932 26.189 
62 BIS C 13.371 24.957 23.944 

62 BIS O 14.409 25.642 23.918 

63 GLY N 13.351 23.723 24.453 
63 GLY CA 14.577 23.186 25.039 
63 GLY C 15.709 23.028 24.021 

63 GLY O 16.870 23.232 24.356 

64 THR N 15.375 22.712 22.746 
64 THR CA 16.392 22.485 21.700 
64 THR CB 15.729 21.894 20.395 
64 THR OG1 15.057 20.682 20.709 
64 THR CG2 16.823 21.570 19.338 
64 THR C 17.078 23.790 21.373 

64 THE O 18.267 23.810 

65 BIS N 16.252 24.838 21.308 



65 BZ8 CA 16.749 26.168 20.989 
65 818 CB 15.534 27.012 20.769 
65 HZS CO 15.850 28.409 20.237 
65 HIS C02 15.686 28.794 18.918 
65 BZ8 HOI 16.319 29.457 20.941 
65 HZS Cfl 16,438 30.45S 20.096 
65 HZ8 NB2 16.0S6 30.048 18.887 
i\ III C 17 ' 67 * 26.657 22.118 

65 HZS 0 18.820 27.073 21.904 

66 VAL N 17.220 26.S35 23.376 
!! * 18 ' 084 26.803 24.544 
66 VAL CB 17.3S1 26.378 25.832 
66 VAL 001 18.194 26.482 27.092 
66 VAL 002 16.264 27.335 25.994 
66 VAL C 19.427 26.062 24.566 

66 VAL O 20.494 26.687 24.586 

67 ALA N 19.347 24.730 24.292 
67 ALA CA 20.534 23.878 24.204 
67 ALA CB 20.081 22.462 23.828 
67 ALA C 21.526 24.393 23.140 

67 ALA 0 22.732 24.464 23.385 

68 OLY N 21.028 24.643 21.978 
68 OLY CA 21.890 25.373 20.923 
68 GLY C 22.602 26.682 21.221 

68 OLY O 23.730 26.888 20.726 

f! IS! N 22 ' 009 27.S80 22.020 

69 THR CA 22.727 28.785 22.414 
69 THR CB 21.703 29.733 23.084 
69 THR 001 20.690 29.972 22.076 
69 THR CC2 22.339 31.046 23.576 
69 THR C 23.902 28.431 23.353 

69 THR O 24.986 29.042 23.288 

70 ZLK N 23.686 27.426 24.235 
70 ILE CA 24.771 26.952 25.107 
70 I LB CB 24.305 25.947 26.219 
70 ILB CG2 25.501 25.525 27.092 
70 ILB COl 23.197 26.607 27.065 
70 ILB CD 22.458 2S.687 28.103 
70 ILB C 25.820 26.222 24.285 

70 ILB O 27.014 26.530 24.398 

71 ALA N 25.447 25.251 23.451 
71 ALA CA 26.467 24.349 22.986 
71 ALA CB 26.523 23.129 23.948 
71 ALA C 26.352 23.895 21.578 

71 ALA O 26.869 22.805 21.295 

72 ALA N 25.785 24.709 20.671 
72 ALA CA 25.772 24.252 19.280 
72 ALA CB 25.105 25.252 18.367 
72 ALA C 27.223 24.056 18.832 

72 ALA O 28.112 24.803 19.205 

73 LEU N 27.412 22.934 18.090 
73 LEU CA 28.744 22.458 17.726 
73 LBO CB 28.630 21.030 17.087 
73 LEU CO 27.913 19.969 17.918 
73 LBO CD1 27.805 18.638 17.193 
73 LEO CD2 28.650 19.898 19.221 
73 LEO C 29.465 23.384 16.782 

73 LBO O 28.857 23.968 15.858 

74 ASN N 30.768 23.410 17.002 
74 ASN CA 31.650 24.268 16.196 
74 ASN CB 32.829 24.736 17.002 
74 ASN CC 33.638 25.786 16.240 
74 ASN OD1 33.278 26.358 15.207 
74 ASN ND2 34.798 26.098 16.774 
74 ASN C 32.170 23.435 15.022 

74 ASN 0 33;097 22.o3» J m 

75 ASN N 31.602 23.663 tl'.lll 
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75 ASH CA 31.833 22.805 12.665 
75 ASK CB 30.957 21.533 12.702 
75 ASH 00 29.491 21.798 13.007 
75 ASH 001 28.914 22.891 12.815 
75 ASH H02 28.669 20.780 13.605 
75 ASH C 31.502 23.581 11.415 

75 ASH O 31.582 24.793 11.490 

76 SIR H 31.121 22.947 10.298 
76 SIR CA 30.794 23.635 9.055 
76 SIR CB 31.452 22.852 7.920 
76 SIR 00 32.867 22.956 8.023 
76 SIR C 29.308 23.826 8.771 

76 SIR O 28.913 24.172 7.628 

77 ILK ■ 28.486 23.612 9.815 
77 ILI CA 27.049 23.710 9.658 
77 ILI CB 26.315 22.283 9.597 
77 ILI O02 26.735 21.594 8.269 
77 ILI 001 26.604 21.393 10.803 
77 ILI CO 25.657 20.178 10.887 
77 ILI C 26.407 24.494 10.799 

77 ILI O 26.960 24.700 11.891 

78 GLY H 25.199 24.925 10.501 
78 CLY CA 24.338 25.534 11.486 
78 CLY C 24.874 26.773 12.159 

78 CLY O 25.345 27.713 11.542 

79 VAL R 24.781 26.721 13.475 
79 VAL CA 25.226 27.840 14.293 
79 VAL CB 23.977 28.470 15.058 
79 VAL CGI 23.105 29.130 14.034 
79 VAL 062 23.172 27.468 15.841 
79 VAL C 26.342 27.460 15.258 

79 VAL O 27.035 26.445 IS. 015 

80 LIO K 26.574 28.266 16.310 
80 LEO CA 27.681 28.023 17.216 
80 LEO CB 28.856 28.882 16.777 
80 LEO CC 30.090 28.886 17.612 
80 LEO CD1 30.630 27.510 17.592 
80 LEO CD2 , 31.076 29.900 17.113 
80 LEO C 27.210 28.436 18.614 

80 LEO O 26.667 29.536 18.725 

81 GLY R 27.333 27.597 19.625 
81 CLY CA 26.928 28.085 20.924 
81 GLY C 28.076 28.805 21.662 

81 GLY O 29.253 28.863 21.248 

82 VAL H 27.794 29.222 22.883 
62 VAL CA 28.824 29.876 23.663 
82 VAL CB 28.207 30.550 24.929 
82 VAL CGI 29.266 31.108 25.913 
82 VAL CG2 27.250 31.691 24.395 
82 VAL C 29.915 28.926 24.085 

82 VAL O 31.102 29.295 24.118 

83 ALA H 29.504 27.716 24.494 
83 ALA CA 30.437 26.706 24.970 
83 ALA CB 30.194 26.444 26.456 
83 ALA C 30.270 25.404 24.181 

83 ALA O 29.605 24.459 24.615 

84 PRO H 30.827 25.356 22.956 
84 PRO CD 31.627 26.423 22.334 
84 PRO CA 30.449 24.325 21.985 
84 PRO CB 30.988 24.826 20.658 
84 PRO CG 31.954 25.925 20.928 
84 PRO C 30.900 22.929 22.328 

84 PRO O 30.460 21.987 21.673 

85 SER H 31.795 22.800 23.311 
85 SER CA 32.303 21.525 23.810 
85 SER CB 33.826 21.574 23. 9^4 
8b SER CO 34.358 21.691 22.630 



85 SIR C 31.718 21.142 25.147 

85 SIR O 32.U7 20.128 25.689 

86 ALA H 30.806 21.904 25.738 
86 ALA CA 30.151 21.459 26.967 

86 ALA CB 29.153 22.489 27.391 
It St C l 9 ' 40 * 20 - 12a 26.787 
It St 0 28 - 914 19 » 771 25.689 

87 OW N 29.338 19.367 27.882 
87 OLD CA 28.637 16.102 27.917 
87 OLO CB 29.274 17.235 28.965 
87 OLO CO 30.727 16.977 28.652 

67 OLO CO 31.359 15.911 29.523 
87 OLO Oil 30.638 15.142 30.165 
87CLOOI2 32.580 15.8S0 29.550 

11 SZ C 27,172 28.237 

87 OLO O 26.787 18.788 29.353 
f f "2 ■ ? 26.340 18.241 27.230 

88 LIO CA 24.949 18.654 27.326 
88 LIO CB 24.566. 19.080 2S.910 
88 LIO CO 23.S61' 20.137 25.626 

68 LIO CD1 23.929 21.475 26.321 
It 002 23.521 20.293 24.093 

ffJfS £ 24 ' 042 17 « 570 27.876 

88 LIO O 24.093 16.491 27.282 

89 TYR H 23.223 17.777 28.919 
89 TYR CA 22.249 16.807 29.449 
89 TYR CB 22.538 16.474 30.942 
89 TYR CG 23.828 15.673 31.047 
89 TYR CDl 25.048 16.317 30.920 
89 TYR CI1 26.230 15.627 30.860 
89 TYR CD2 23.797 14.292 31.142 
89 TYR CE2 24.979 13.578 31.070 
89 TYR CZ 26.175 14.250 30.937 
89 TYR OH 27.340 13.513 30.872 
89 TYR C 20.847 17.347 29.318 

89 TYR O 20.561 18.513 29.646 

90 ALA N 20.000 16.511 28.733 
90 ALA CA 18.613 16.880 28.538 
90 ALA CB 17.991 16.206 27.306 
90 ALA C 17.794 16.453 29.749 

90 ALA O 17.565 15.260 29.984 

91 VAL H 17.307 17.405 30.542 
91 VAL CA 16.489 17.070 31.706 
91 VAL CB 17.050 17.737 32.979 
91 VAL CGI 16.278 17.172 34.186 
91 VAL CG2 18.529 17.434 33.152 
91 VAL C 15.086 17.576 31.413 

91 VAL O 14.803 18.789 31.545 

92 LYS H 14.186 16.716 30.935 
92 LYS CA 12.860 17.211 30.608 
92 tYS CB 12.271 16.257 29.604 
92 LYS CG 10.802 16.621 29.273 
92 LYS CD 10.070 15.579 28.398 
92 LYS CE 10.580 15.652 26.970 
92 LYS HZ 9.873 14.730 26.095 
92 LYS C 12.009 17.347 31.892 

92 LYS O 11.719 16.396 32.624 

93 VAL H 11.659 18.596 32.162 
93 VAL CA 10.834 18.966 33.299 
93 VAL CB 11.520 19.956 34.31S 
93 VAL CGI 12.719 19.267 34.948 
93 VAL CC2 11.808 21.301 33.634 

93 VAL C 9.545 19.632 32.844 

11 ^ S 8 * 636 19 « 907 33.627 

94 LED H 9.434 19.988 31.564 
94 LRU CA 8,252 20.323 -1.C22 
94 LEO CB 8.576 22.025 30.524 
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94 LXU CO 9.291 22.983 31.432 
94 UO CD1 9.772 24.166 30.604 
94 LBO CD2 0.380 23.374 32.555 
94 LIO C 7.783 19.781 29.830 

94 UO O 8.605 19.154 29.150 

95 OLY M 6.479 19.754 29.581 
95 GLY CA 5.913 18.985 28.494 
95 GLY C 5.987 19.713 27.150 

95 GLY O 6.394 20.861 27.052 

96 ALA N 5.518 18.995 26.112 
96 ALA CA 5.460 19.485 24.733 
96 ALA Cfi 4.826 18.406 23.824 
96 ALA C 4.659 20.791 24.611 

96 ALA O 4.945 21.657 23.772 

97 ASP H 3.680 20.986 25.506 
97 ASP CA 2.957 22.248 25.636 
97 ASP CB 1.637 22.010 26.330 
97 ASP 00 1.665 21.267 27.665 
97 ASP OD1 2.704 20.782 28.130 
97 ASP 002 0.596 21.183 28.270 
97 ASP C 3.645 23.410 26.351 

97 ASP O 3.058 24.477 26.509 

98 GLY K 4.865 23.232 26.620 
98 GLY CA 5.597 24.264 27.561 
98 GLY C 5.223 24.311 29.038 

98 GLY 0 5.866 24.997 29.828 

99 ARC N 4.226 23.548 29.442 
99 ARC CA 3.746 23.492 30.813 
99 ARG CB 2.274 23.049 30.885 
99 ARG CG 1.275 23.728 29.965 
99 ARG CD 1.373 2S.198 30.169 
99 ARG NX 0.065 25.771 29.978 
99 ARG CZ -0.085 27.070 29.703 
99 ARG NH1 -1.339 27.516 29.555 
99 ARG NH2 0.956 27.923 29.560 
99 ARG C 4.518 22.498 31.672 
99 ARG O 4.851 21.418 31.175 

100 GLY N 4.746 22.767 32.962 
100 GLY CA 5.370 21.790 33.846 
100 GLY C 5.043 22.002 35.327 

100 GLY O 4.933 23.136 35.803 

101 ALA N 4.881 20.881 36.029 
101 ALA CA 4.592 20.897 37.462 
101 ALA CB 4.090 19.544 37.966 
101 ALA C 5.844 21.210 38.278 

101 ALA O 6.945 20.745 37.930 

102 I LB N 5.672 21.920 39.412 
102 I LB CA 6.812 22.262 40.268 
102 I LB CB 6.297 23.134 41.461 
102 I LB CG2 7.414 23.536 42.429 
102 I LB CGI 5.672 24.383 40.856 
102 I LB CO 6.675 25.257 40.045 
102 ZLB C 7.555 21.016 40.763 

102 I LB O 8.790 21.014 40.848 

103 SBR N 6.839 19.922 41.067 
103 SBR CA 7.477 18.691 41.459 
103 SBR CB 6.399 17.659 41.711 
103 SBR OG 5.570 17.479 40.562 
103 SBR C 8.451 18.211 40.361 

103 SBR O 9.575 17.820 40.676 

104 SBR N 8.068 16.299 39.085 
104 SBR CA 8.950 17.948 37.972 
104 SBR CB 8.185 18.077 36.660 
104 SBR OG 7.214 17.048 36.535 
104 SBR C 10.230 18.802 37.697 

104 SBR O 11.330 18.272 37.756 

105 ILE N 10.136 20.124 38 04) 



105 ZLB CA 11.306 20.992 36.055 
105 ZLB CB 10.762 22.425 38.033 
105 ZLB 002 12.002 23.365 36.118 
105 ZLB 001 9.919 22.652 36.794 
105 ZLB CD 9.191 24.036 36.796 
105 ZLB C 12.166 20.703 39.293 

105 ZLB 0 13.406 20.539 39.166 

106 ALA H 11.505 20.494 40.464 
106 ALA CA 12.324 20.165 41.677 
106 ALA CB 11.347 20.164 42.670 
106 ALA C 13.009 18.797 41.505 

106 ALA 0 14.185 10.706 41.872 

107 GZJf II 12.452 17.715 40.904 
107 CUf CA 13.267 16.487 40.797 
107 CUf CB 12.464 15.170 40.501 
107 GLH CG 11.300 14.761 41.453 
107 GUI CD 10.582 13.516 41.085 
107 GLH OBI 9.435 13.412 41.526 
107 GLH KE2 11.040 12.542 40.292 
107 GUI C 14.299 ' 16.625 39.702 

107 GUI O 15.333 15.973 39.804 

108 GLY N 14.050 17.494 38.722 
108 GLY CA 15.068 17.832 37.732 
108 GLY C 16.261 18.376 36.456 

108 GLY O 17.409 17.922 38.169 

109 LEU K 16.086 19.337 39.380 
109 LEU CA 17.203 19.921 40.151 
109 LEO CB 16.703 21.098 40.941 
109 LEU CG 16.358 22.306 40.103 
109 LEU CD1 15.553 23.267 40.958 
109 LEU CD2 17.613 22.976 39.579 
109 LEU C 17.899 18.952 41.088 

109 LEO O 19.137 18.923 41.163 

110 GLO N 17.146 18.078 41.739 
110 GLO CA 17.767 16.997 42.502 
110 CLU CB 16.706 16.208 43.295 
110 GLO CG 16.044 17.043 44.443 
110 GLO CD 16.869 17.518 45.693 
110 GLO OBI 16.284 18.250 46.507 
110 GLO OB2 18.058 17.20S 45.884 
110 GLO C 18.562 16.049 41.616 

110 GLO O 19.674 15.702 42.025 

111 TRP N 18.111 15.691 40.389 
111 TRP CA 18.867 14.850 39.469 
111 TRP CB 18.049 14.586 38.169 
111 TRP CG 18.743 13.709 37.091 
111 TRP CD2 19.617 14.121 36.111 
111 TRP CB2 19.919 12.914 35.467 
111 TRP CB3 20.195 15.302 35.658 
111 TRP CD1 18.535 12.343 37.029 
111 TRP NE1 19.264 11.895 36.042 
111 TRP CZ2 20.803 12.903 34.389 
111 TRP CZ3 21.073 15.292 34.585 
111 TRP CH2 21.370 14.099 33.959 
111 TRP C 20.160 15.S63 39.124 

111 TRP O 21.198 14.910 39.072 

112 ALA N 20.134 16.881 38.876 
112 ALA CA 21.331 17.620 38.528 
112 ALA CB 21.029 19.102 38.310 

^ £ "- 411 17 - 530 39 -"2 

112 ALA 0 23.578 17.183 39.356 

113 GLY N 22.019 17.742 40.859 
113 GLY CA 22.962 17.686 . 41.947 
113 GLY C 23.404 16.258 42.205 

113 GLY 0 24.567 16.052 42.565 

114 ASH H 22.524 15.285 42.009 
114 ASH CA 22.901 13 872 '* ' " 
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114 ASN CB 21.735 12.856 42.176 

114 ASN CO 20.764 12.994 43.318 

114 ASH 001 21.095 13.531 44.373 

114 ASN HD2 19.511 12.575 43.163 

114 ASN C 23.820 13.339 41.111 

114 ASN 0 24.532 12.346 41.311 

115 ASN N 23.767 13.953 39.923 
115 ASN CA 24.558 13.494 38.817 
115 ASN CB 23.678 13.382 37.576 
115 ASN 00 22.871 12.090 37.637 
115 ASN 001 23.296 11.044 37.144 
115 ASN ND2 21.716 12.088 38.291 
115 ASN C 25.761 14.354 36.510 

115 ASN O 26.352 14.277 37.428 

116 OLY N 26.126 15.225 39.431 
116 CLY CA 27.354 15.971 39.331 
116 OLY C 27.372 16.991 38.204 

116 OLY O 28.450 17.247 37.614 

117 NET N 26.235 17.614 37.909 
117 NET CA 26.210 18.667 36.878 
117 MKT CB 24.807 19.105 36.509 
117 MET CO 23.929 18.029 35.895 
117 XH SO 24.529 17.426 34.290 
117 KET CB 24.874 15.741 34.705 
117 KET C 26.888 19.893 37.466 

117 MKT 0 26.805 20.170 38.688 

118 BIS N 27.549 20.672 36.615 
118 HIS CA 28.186 21.879 37.094 
118 HIS CB 29.481 22.174 36.318 
118 HIS CO 30.504 21.026 36.418 
118 BIS CD 2 30.795 20.176 35.397 
118 HIS ND1 31.283 20.653 37.437 
118 BIS CE1 32.020 19.622 37.044 
118 HIS HZ 2 31.715 19.339 35.797 
118 HIS C 27.256 23.067 36.967 

118 HIS O 27.293 23.969 37.781 

119 VAL N 26.349 23.070 35.989 
119 VAL CA 25.540 24.246 35.723 
119 VAL CB 26.124 25.082 34.533 
119 VAL CGI 25.194 26.267 34.244 
119 VAL CG2 27.537 25.612 34.864 
119 VAL C 24.194 23.670 35.344 

119 VAL O 24.123 22.627 34.674 

120 ALA N 23.150 24.305 35.817 
120 ALA CA 21.801 23.917 35.457 
120 ALA CB 21.074 23.434 36.689 
120 ALA C 21.128 25.170 34.893 

120 ALA 0 21.156 26.255 35.503 

121 ASN N 20.621 25.061 33.673 
121 ASN CA 19.917 26.133 32.994 
121 ASN CB 20.330 26.144 31.516 
121 ASN CO 19.771 27.346 30.778 
121 ASN OD1 20.464 26.304 30.514 
121 ASN ND2 18.511 27.315 30.418 
121 ASN C 18.399 25.942 33.133 

121 ASN O 17.793 24.936 32.715 

122 LEU N 17.740 26.917 33.768 
122 LEU CA 16.277 26.942 33.962 
122 LEU CB 15.895 27.041 35.454 
122 LEU CO 16.010 25.856 36.340 
122 LEU C01 15.879 26.350 37.770 
122 LEU CD 2 14.914 24.875 36.066 
122 LEU C 15.706 28.182 33.264 

122 LEU O 15.616 29.298 33.808 

123 SER N 15.297 28.013 32.012 
123 SER CA 14.756 29.116 31.232 
123 SER CB 15.164 28.969 29.746 



123 SIR 00 16.514 29.408 29.479 
123 SIR C 13.240 29.029 31.383 

123 SKR O 12.521 28.751 30.400 

124 LIU N 12.816 29.236 32.647 
124 UU CA 11.426 29.119 33.059 
124 LEO CB 11.093 27.646 33.233 
124 LEU CO 12.008 26.810 34.115 
124 LEU CD1 11.540 26.904 35.610 
124 LIU CD2 11.993 25.356 33.606 
124 LEU C 11.200 29.697 34.347 

124 LEU O 12.165 30.261 35.045 

125 OLY N 9.951 30.177 34.709 
125 OLY CA 9.733 31.019 35.884 
125 GLY C 8.243 31.204 36.140 

125 OLY O 7.396 31.003 35.252 

126 SER N 7.991 31.643 37.370 
126 SER CA 6.640 31.772 37.688 
126 SER CB 6.331 30.503 38.752 
126 SER 00 5.242 30.673 39.682 
126 SEA C 6.623 33.055 38.707 

126 SER 0 7.650 33.353 39.302 

127 PRO N 5.544 33.844 36.839 
127 PRO CD 4.300 33.663 38.086 
127 PRO CA 5.458 35.005 39.740 
127 PRO CB 4.310 35.813 39.157 
127 PRO CO 3.377 34.706 38.715 
127 PRO C 5.258 34.663 41.234 

127 PRO O 5.342 35.518 42.119 

128 SER N 4.904 33.408 41.511 
128 SER CA 4.673 32.939 42.860 
128 SER CB 3.340 32.142 42.821 
128 SER OG 2.292 33.013 42.389 
128 SER C 5.845 32.100 43.399 

128 SER 0 6.430 31.293 42.646 

129 PRO N 6.223 32.275 44.678 
129 PRO CD 5.713 33.322 45.595 
129 PRO CA 7.185 31.419 45.363 
129 PRO CB 7.492 32.187 46.641 
129 PRO CO 6.138 32.757 46.937 
129 PRO C 6.639 29.999 45.605 

129 PRO O 5.416 29.779 45.693 

130 SER N 7.567 29.069 45.789 
130 SER CA 7.242 27.724 46.139 
130 SER CB 7.197 26.894 44.688 
130 SER OG 7.387 25.528 45.215 
130 SER C 8.260 27.146 47.092 

130 SER O 9.462 27.127 46.751 

131 ALA N 7.759 26.596 48.220 
131 ALA CA 8.619 25.896 49.154 
131 ALA CB 7.818 25.334 50.312 
131 ALA C 9.445 24.755 46.557 

131 ALA O 10.670 24.654 48.755 

132 THR N 8.761 23.973 47.716 
132 THR CA 9.373 22.810 47.044 
132 THR CB 8.274 22.155 46.232 
132 THR OG1 7.351 21.804 47.256 
132 THR CG2 8.667 20.937 45.371 
132 THR C 10.547 23.223 46.156 

132 THR 0 11.674 22.711 46.213 

133 LEU N 10.257 24.266 45.394 
133 LEU CA 11.185 24.742 44.396 
133 LEO CB 10.467 25.753 43.511 
133 LEU CO 11.231 26.287 42.326 
133 LEU CD1 11.504 25.174 41.324 
133 LEU CT2 10.395 27.377 41.661 
133 LEU C 12.393 2b. 365 45.08l" 
133 LEU O 13.539 25.053 44.693 
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134 OLO N 12.164 26.195 49.111 
134 OLO CA 13.276 26.827 46.768 
134 OLD CB 12.749 27.706 47.793 
134 OLO 00 13.79S 26.476 48.64S 
134 OLD CO 13.249 29.330 49.814 
134 OLO Oil 14.013 30.014 50.482 
134 OLO 012 12.046 29.337 50.036 
134 OLO C 14.181 25.79$ 47.420 

134 OLO 0 15.396 25.91S 47.353 

135 OUt M 13.598 24.770 48.060 
135 OLN CA 14.373 23.701 48.651 
135 OLN CB 13.350 22.830 49.331 
135 OLN CO 13.897 21.596 50.006 
135 OLN CD 12.823 20.790 50.764 
135 OLN Oil 11.779 20.305 50.258 
135 OLN NZ2 13.143 20.692 52.060 
135 OLN C 15.248 22.952 47.620 

135 OLN O 16.434 22.651 47.868 

136 ALA N 14.690 22.749 46.420 
136 ALA CA 15.406 22.071 45.337 
136 ALA CB 14.430 21.762 44.225 
136 ALA C 16.556 22.950 44.802 

136 ALA O 17.676 22.465 44.S13 

137 VAL N 16.313 24.272 44.677 
137 VAL CA 17.37S 25.224 44.305 
137 VAL CB 16.834 26.694 44.238 
137 VAL C01 17.998 27.738 44.134 
137 VAL C02 15.876 26.776 43.047 
137 VAL C 18.531 25.152 45.317 

137 VAL O 19.711 24.982 44.974 

138 ASN N 18.136 25.179 46.588 
138 ASN CA 19.136 25.146 47.616 
138 ASN CB 18.498 25.457 48.973 
138 ASN CC 18.125 26.934 49.063 
138 ASN 001 18.598 27.789 48.320 
138 ASN ND2 17.258 27.299 49.985 
138 ASN C 19.869 23.832 47.685 

138 ASN O 21.103 23.849 47.846 

139 SER N 19.209 22.709 47.506 
139 SER CA 19.937 21.466 47.610 
139 SER CB 19.001 20.303 47.649 
139 SER. OG 18.203 20.407 46.479 
139 SER C 20.860 21.316 46.403 

139 SIR O 22.027 20.902 46.586 

140 ALA N 20.431 21.663 4S.160 
140 ALA CA 21.392 21.545 44.053 
140 ALA CB 20.755 21.895 42.723 
140 ALA C 22.593 22.460 44.264 

140 ALA 0 23.740 22.070 44.057 

141 THR N 22.377 23.682 44.756 
141 THR CA 23.473 24.599 45.081 
141 THR CB 22.851 25.918 45.S87 
141 THR 0C1 22.034 26.472 44.549 
141 THR CC2 23.908 26.914 45.924 
141 THR C 24.419 23.994 46.121 

141 THR 0 25.644 24.054 45.907 

142 SER N 23.975 23.363 47.202 
142 SBR CA 24.937 22.839 48.134 
142 SER CB 24.216 22.599 49.442 
142 SER 00 23.086 21.786 49.207 
142 SER C 25.620 21.592 47.583 

142 SER 0 26.616 21.131 48.150 

143 ARC N 25.155 21.02S 46.447 
143 ARC CA 25.865 19.94S 45.761 

-V CB 24.848 18.9U7 4b,i:61 
143 ARG CC 24.269 18.107 46.467 
143 ARC CD 23.132 17.127 46.152 



}il MS? n 22 .030 17.901 45.589 

"2*52 CX 21 * M7 17.422 44.816 

111 mi 20,140 10 « 201 ««.355 

143ARONH2 20.887 16.134 44.524 

Ai 5 I 6 - 145 20 - 456 4 «-«o2 
}JJ2fS 2 U'W "- 708 43 - 740 

iiiSP- " 27.007 21.760 44.635 

144 « 27.925 22.456 43.737 

144 OUT C 27.365 22.887 42.396 
\il S? 2 22.324 41.531 

*.? 2 6,048 22.782 42.186 

145 VAL CA 25.46S 23.150 40.874 
145 VAL CB 24.118 22.435 40.672 

148 5£ 2! "I" 22.778 M.IJl 

145 VAL 002 24.324 20.921 40.792 

145 VAL C 25.262 24.660 40 82? 

145 VAL O 24.836 25.262 41.640 

146 LEO N 25.677 25.350 39.742 

III -52 2 li'lll 287S8 S.w 
l£Hn ™ 2!^5 l 27 - SX8 38 - 740 

146 LEO CC 26.005 28.987 38.374 
146L80CD1 25.819 29 816 39 604 

til 0 23,978 26.800 38.875 

146 LEO O 23.873 26.371 37.710 

147 VAL N 22.940 27.297 39.523 
147 VAL CA 21.611 27.371 38 926 
147 VAL CB 20.552 27.093 40.011 
147 VAL CGI 19.153 27.272 39.387 
147VALCG2 20.649 25.642 40.526 
147 VAL C 21.405 28.740 38.305 

147 VAL O 21.480 29.768 38.96S 

148 VAL N 21.138 28.776 37.003 
148 VAL CA 21.007 30.019 36.2S1 
148 VAL CB 21.982 30.003 3S. 055 
148 VAL CGI 21.916 31.349 34.328 
148 VAL CC2 23.403 29.791 35.562 

148 VAL C 19.557 30.040 35.781 

l*Am 0 19,127 28.064 35.128 

149 ALA N 18.826 31.120 36.019 

tllS* 17,387 31 ' 187 35.758 

149 ALA CB 16.610 31.028 37.063 

149 ALA C 16.952 32.515 3S.U1 

0 17-539 33.555 35.396 

150 ALA N IS. 931 32.454 34.249 
150 ALA CA 1S.37S 33.605 33.549 

CB 14,427 33 ' 109 ".448 

"0*LA C 14.588 34.556 34.469 

150 ALA 0 13.789 34.092 35.290 

151 SER N 14.717 35.878 34.313 

^Jf!2 13,991 3 «'« 41 35.145 

151 SER CB 14.526 38.284 34.979 

151 SBR OG 14.430 38.730 33.630 

^ C 12 ' 48 5 36.873 34.867 

151 SER o 11.692 37.218 35.761 

152 CLY N 12.062 36.534 33.633 
152 CLT CA 10.646 36.425 33.269 

iSS « J?- 382 37,457 32,19 3 

152 GLT O 11.117 38.447 32.024 

153 ASN N 9.271 37.263 31.499 
153 ASN CA 8.969 38.082 30.352 
153 ASN CB 8.689 37.237 29.116 
153 ASH CG 9.865 36.443 28 658 
153 ASN 001 11.041 36.707 28.880 

53ASNND2 9.501 35.336 ,,9"! 
153 ASN C 7.7S9 38.990 30 526 
153 ASN O 7.190 39.421 29 524 



SUBSTITUTE SHEET 



WO 92/21760 



PCT/US92/04306 



9/18 
FIGURE 1 



154 SIR N 7.390 39.398 31.739 
154 8ER CA 6.193 40.206 31.915 
154 8ER CB 5.577 39.973 33.384 
154 SBR OC 6.365 40.558 34.319 
154 SCR C 6.534 41.682 31.798 

154 SSR O 5.599 42.468 31.793 

155 OtY N 7.805 42.092 31.773 
155 CLY CA 8.154 ,43.499 31.759 
155 CLY C 6.028 44.150 33.143 

155 CLY O 8.292 45.349 33.278 

156 ALA N 7.640 43.439 34.19S 
156 ALA CA 7.476 44.065 35.498 
156 ALA CB 6.649 43.170 36.405 
156 ALA C 8.814 44.359 36.187 

156 ALA O 9.864 43.754 35.891 

157 SBR N 8.746 45.315 37.132 
157 SBR CA 9.857 45.747 37.932 
157 SBR CB 9.592 47.150 38.402 
157 SBR CC 8.442 47.158 39.213 
157 SBR C 10.085 44.828 39.123 

157 SBR O 10.623 45.251 40.147 

158 SBR N 9.695 43.568 39.049 
158 SBR CA 10.126 42.600 40.061 
158 SBR CB 9.046 42.518 41.150 
158 SBR OG 7.823 41.997 40.640 
158 SBR C 10.335 41.293 39.275 

158 SBR O 9.682 41.091 38.225 

159 XLS H 11.265 40.413 39.718 
159 ILB CA 11.600 39.245 38.894 
159 ILB CB 13.164 39.024 38.847 
159 ILB C62 13.801 40.300 38.272 
159 ILB CC1 13.729 38.612 40.201 
159 ILB CD 15.208 38.246 40.013 
159 ILB C 10.906 37.978 39.381 

159 ILB O 10.454 37.888 40.528 

160 SBR N 10.806 36,974 38.510 
160 SBR CA 10.114 35.754 38.841 
160 SSR CB 9.658 35.097 37.513 
160 SBR OG 16.700 34.817 36.581 

160 SSR C 10.947 34.777 39.691 
Jf? £5? ° "'"2 34.921 39.958 

161 TYR N 10.26S 33.738 40.148 
161 TYR CA 10.867 32.645 40.876 
JfJ ~ R CB 9 ' 887 32-231 41.988 
161 TYR CC 9.698 33.315 43.030 
161 TYR CD1 10.614 33.397 44.072 
161 TYR CS1 10.459 34.368 4S.057 
161 TYR CD2 8.619 34.189 42.939 
161 TYR CB2 8.459 35.175 43.906 
161 TYR C£ 9.384 35.241 44.953 

Jf 1 08 9 * 270 36.241 «5-896 

161 TYR C 11.101 31.499 39.865 

161 TYR O 10.257 31.307 38.975 

162 PRO N 12.153 30.681 39.954 
162 PRO CD 12.388 29.536 39.042 
162 PRO CA 13.162 30.687 41.003 
162 PRO CB 13.715 29.232 40.966 

11° 00 ".726 28.915 39.470 

162 PRO C 14.243 31.756 40.879 

162 PRO O 15.044 31.845 41.789 

163 ALA R 14.352 32.580 39.814 
163 ALA CA 15.393 33.575 39.716 
163 ALA CB 1S.16S 34.416 38.441 
163 ALA C 15.538 34.529 40.935 

163 ALA O 16.640 34.874 41.399 
16 , .V*C « 14.417 34.878 41.559 

164 ARC CA 14.385 35.740 42.745 



164 AR0 CB 12.939 36.127 43.071 

JfJ ^ 00 12 ' 741 37 ' 08 « ««.«7 

164 ARC CD 13.377 38.408 43,906 

164 ARC KB 13.251 39.367 44.988 

164 ARC CZ 14.206 39.530 45.901 

164 ARC KH1 14.020 40.475 46.838 

}«« ™ NH2 i5-289 38.737 45.965 

164 ARC C 15.032 35.123 43.973 

164 ARC 0 15.559 35.875 44.807 

165 TYR N 15.147 33.808 44.046 
165 TYR CA 15.787 33.157 45.176 
16S TYR CB 15.503 31.609 45.150 
165 TYR CC 14.046 31.230 45.501 
165 TYR CD1 13.399 31.807 46.600 
165 TYR CB1 12.084 31.484 46.885 
165 TYR CD2 13.379 30.328 44.696 
165 TYR CB2 12.067 30.003 44.992 
165 TYR Ct 11.444 30.587 46.078 
16S TYR OH 10.133 30.227 46.357 
165 TYR C 17.293 33.408 45.179 

165 TYR 0 17.996 33.477 44.141 

166 ALA N 17.829 33.600 46.368 
166 ALA CA 19.222 33.986 46.S44 
166 ALA CB 19.SS2 34.070 48.042 
166 ALA C 20.231 33.070 45.878 

166 ALA O 21.192 33.553 45.278 

167 ASN N 19.920 31.767 45.871 
167 ASN CA 20.860 30.806 45.280 
167 ASN CB 20.778 29.446 46.048 
167 ASN CC 21.566 29.545 47.374 
167 ASN ODl 22.592 30.238 47.502 
167 ASN ND2 21.130 28.931 48.461 
167 ASN C 20.712 30.572 43.776 

167 ASN O 21.411 29.727 43.205 

168 ALA N 19.760 31.248 43.121 

III ^ 19 ' 673 <l-«83 

168 ALA CB 18.206 31.007 41.284 

168 ALA C 20.259 32.481 41.121 

168 ALA O 19.961 33.600 41.595 

169 KBT N 21.005 32.366 40.015 
169 MET CA 21.563 33.497 39.321 

?!! £1 CB 22 ' 8S4 33.069 38.636 

169 MET CG 23.476 34.273 37.972 

169 MET SD 25.057 33.B51 37.212 

J" «* CB 25.641 35.532 37.199 

169 MET C 20.493 33.939 38.305 

169 MET O 19.998 33.150 37.484 

170 ALA N 20.047 35.196 38.436 

?™ ^ 18 ' 956 35 ' 777 37.681 

"2 *" CB 18.208 36.758 38.591 

170 ALA C 19.430 36.504 36.432 

170 ALA O 20.278 37.405 36.596 

171 VAL H 18.927 36.1S8 35.241 

' ^ °* 19 ' 332 36 ' 739 33.966 

171 VAL CB 19.862 35.590 33.075 

171 VAL CC1 20.380 36.267 31.766 

171 VAL CG2 20.946 34.749 33.786 

171 VAL C 18.192 37.445 33.235 

171 VAL O 17.14S 36.82i 11 979 

172 CLY N 18.474 38.712 32.887 
172 CLY CA 17.S94 39.568 32.123 
172 CLY C 18.038 39.553 30.640 

172 CLY O 19.106 39.023 30 302 

,11 ^ * 17 ' 231 40 ' 18 « 29-781 

173 ALA CA 17.461 40.220 28.347 

tnX Ctf CB 16 -2'8 39.626 27.617 
173 ALA C 17.667 41.631 27.812 
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173 ALA O 16.987 42.599 28.169 

174 THR N 18.639 41.714 26.910 
174 THR CA 18.993 42.944 26.176 
174 THR C8 20.504 43.281 26.349 
174 THR 001 21.329 42.130 26.074 
174 THR 002 20.733 43.727 27.767 
174 THR C 18.715 42.787 24.688 

174 THR O 18.622 41.662 24.148 

175 ASP N 18.674 43.934 24.029 
175 ASP CA 18.518 43.907 22.586 
175 ASP CB 17.388 44.840 22.148 
175 ASP CC 17.584 46.353 22.386 
175 ASP OD1 18.675 46.644 22.682 
175 ASP OD2 16.579 47.047 22.291 
175 ASP C 19.794 44.256 21.834 

175 ASP O 20.844 44.480 22.440 

176 GLN N 19.724 44.498 20.516 
176 GLN CA 20.938 44.742 19.737 
176 GLN CB 20.702 44.722 18.237 
176 GLN CG 20.123 43.400 17.797 
176 GLN CD 18.592 43.272 17.887 
176 GLN OE1 17.837 44.022 18.543 
176 GLN NB2 18.083 42.254 17.196 
176 GLN C 21.534 46.064 20.056 

176 GLN O 22.690 46.302 19.783 

177 ASN N 20.836 46.989 20.719 
177 ASN CA 21.362 48.288 21.098 
177 ASN CB 20.321 49.300 20.975 
177 ASN CG 19.832 49.550 19.587 
177 ASN OD1 20.577 49.605 18.631 
177 ASN ND2 18.526 49.678 19.484 
177 ASN C 21.895 48.299 22.521 

177 ASN O 22.380 49.322 23.026 

178 ASN N 21.875 47.139 23.202 
178 ASN CA 22.256 47.033 24.623 
178 ASN CB 23.735 47.479 24.896 
178 ASN CG 24.734 46.515 24.314 
178 ASN OD1 24.433 45.324 24.210 
178 ASN ND2 25.920 46.928 23.917 
178 ASN C 21.345 47.835 25.547 

178 ASN O 21.747 48.392 26.576 

179 ASN N 20.081 47.806 25.174 
179 ASN CA 19.000 48.319 26.009 
179 ASN CB 18.044 49.165 25.243 
179 ASN CG 18.566 50.593 25.088 
179 ASN 001 19.289 51.155 25.949 
179 ASN ND2 18.250 51.161 23.925 
179 ASN C 18.230 47.101 26.490 

179 ASN O 18.246 46.016 25.672 

180 ARG N 17.579 47.276 27.645 
180 ARG CA 16.734 46.241 28.230 
180 ARG CB 16.050 46.746 29.525 
180 ARG CG 15.269 45.653 30.233 
180 ARG CD 14.562 46.201 31.492 
180 ARG NE 13.537 47.146 31.076 
180 ARG CZ 12.271 46.850 30.720 
180 ARG NH1 11.476 47.846 30.339 
160 ARG NH2 11.709 45.650 30.752 
180 ARG C 15.639 45.909 27.213 

180 ARG O 14.991 46.855 26.715 

181 ALA N 15.377 44.644 26.848 
181 ALA CA 14.225 44.338 26.002 
181 ALA CB 14.266 42.683 25.663 
181 ALA C 12.942 44.677 26.771 

181 ALA O 12.873 44.495 28.009 
18* SH* N 11.894 45.172 26.133 

182 SER CA 10.757 45.650 26.927 



182 SIR CB 9.735 46.338 25.967 
182 SER 00 9.316 45.472 24.927 
162 SER C 10.061 44.602 27.814 

182 8ER O 9.557 44.957 26.679 

183 PHE n 10.056 43.321 27.403 
183 PHE CA 9.501 42.231 28.191 
183 PHE CB 9.261 40.999 27.282 
183 PHE CG 10.501 40.486 26.518 
183 PHE CD1 11.508 39.734 27.142 
183 PHE CD2 10.653 40.840 25.173 
183 PHE CE1 12.617 39.325 26.421 
183 PHE CE2 11.762 40.415 24.478 
183 PHE CZ 12.774 39.665 25.095 
183 PHE C 10.359 41.795 29.380 

183 PHE 0 9.889 41.025 30.246 

184 SER H 11.615 42.247 29.427 
184 SER CA 12.551 41.670 30.410 
164 SER CB 13.998 42.030 30.045 
184 SER OG 14.926 41.420 30.947 
184 SER C 12.261 42.125 31.843 

184 SER O 12.450 43.331 32.137 

185 GLN N 11.911 41.197 32.727 
185 GLN CA 11.652 41.622 34.089 
185 GLN CB 11.034 40.489 34.904 
185 GLN CG 9.595 40.335 34.482 
185 GLN CD 8.912 39.174 35.165 
185 GLN OE1 8.817 39.005 36.377 
185 GLN NE2 8.397 38.320 34.331 
185 GLN C 12.960 42.075. 34.773 

185 GLN O 14.066 41.606 34.458 

186 TYR N 12.871 43.046 35.676 
186 TYR CA 14.048 43.618 36.349 
186 TYR CB 14.468 44.924 35.634 
186 TYR CG 13.385 45.992 35.576 
186 TYR CD1 12.362 45.872 34.635 
186 TYR CE1 11.347 46.805 34.553 
186 TYR CD2 13.385 47.049 36.468 
186 TYR CE2 12.386 47.988 36.396 
186 TYR CZ 11.376 47.855 35.450 
186 TYR OB 10.418 46.846 35.328 
186 TYR C 13.735 43.925 37.819 

186 TYR O 12.616 43.620 38.262 

187 CLY N 14.620 44.547 38.575 
187 GLY CA 14.330 44.849 39.958 
187 CLY C 15.232 44.062 40.892 

187 GLY O 16.318 43.548 40.541 

188 ALA N 14.782 43.915 42.140 
188 ALA CA 15.616 43.340 43.172 
188 ALA CB 14.891 43.435 44.515 
188 ALA C 15.973 41.884 42.894 

188 ALA O 15.134 41.065 42.549 

189 GLY N 17.263 41.594 42.986 
189 CLY CA 17.747 40.223 42.778 
189 GLY C 18.299 39.938 41.358 

189 CLY O 18.911 38.873 41.139 

190 LEU R 18.128 40.857 40.397 
190 LEU CA 18.646 40.601 39.064 
190 LEU CB 18.023 41.621 38.094 
190 LEU CC 18.302 41.4S4 36.607 
190 LEU CD1 17.688 40.163 36.140 
190 LEU CD2 17.844 42.716 35.848 
190 LEU C 20.169 40.671 39.079 

190 LEU O 20.776 41.624 39 589 

191 ASP N 20.847 39.677 38.505 
191 ASP CA 22.285 39.597 38.558 
1*' .W C3 22.732 38.163 tl 777 
191 ASP CG 22.428 37.668 40.182 
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191 ASP OD1 22.903 38.250 41.148 

191 ASP 002 21. 60S 36.717 40.309 

191 ASP C 23.037 40.095 37.355 

191 ASP O 24.122 40.674 37.449 

192 ILK N 22.464 39.842 36.171 
192 ILK CA 23.192 40.070 34.908 
192 ILE CB 24.291 38.919 34.852 
192 ILE CG2 23.628 37.619 34.325 
192 ILK CGI 25.513 39.314 34.012 
192 ILE CD 26.686 38.323 34.226 
192 ILE C 22.176 40.008 33.774 

192 ILE O 21.020 39.545 33.967 

193 VAL N 22.644 40.477 32.608 
193 VAL CA 21.847 40.379 31.392 
193 VAL CB 21.246 41.745 30.945 
193 VAL CGI 20.189 42.187 31.967 
193 VAL CG2 22.326 42.772 30.755 



193 


VAL 


C 


22.653 


39.820 


30.203 


193 


VAL 


o 


23.885 


39.799 


30.174 


194 


ALA 


N 


21.891 


39.376 


29.204 


194 


ALA 


CA 


22.453 


38.610 


28.000 


194 


ALA 


CB 


22.770 


37.303 


28.2S3 


194 


ALA 


C 


21.446 


38.965 


26.837 


194 


ALA 


O 


20.264 


39.273 


27.044 


195 


PRO 


N 


21.672 


38.794 


25.576 


195 


PRO 


CD 


23.294 


38.583 


25.188 


195 


PRO 


CA 


21.018 


38.880 


24.377 


195 


PRO 


CB 


21.699 


38.465 


23.180 


195 


PRO 


CG 


23.321 


38.854 


23.643 


195 


PRO 


C 


19.802 


38.002 


24.479 


195 


PRO 


0 


19.931 


36.816 


24.761 


196 


GLY 


N 


18.648 


38.574 


24.192 


196 


GLY 


CA 


17.403 


37.833 


24.257 


196 


GLY 


C 


16.401 


38.217 


23.175 


196 


GLY 


o 


15.214 


37.925 


23.303 


197 


VAL 


N 


16.829 


38.890 


22.068 


197 


VAL 


CA 


15.888 


39.285 


21.035 


197 


VAL 


CB 


15.690 


40.877 


21.010 


197 


VAL 


CGI 


14.919 


41.323 


19.738 


197 


VAL 


CG2 


15.038 


41.327 


22.327 


197 


VAL 


C 


16.483 


38.765 


19.727 


197 


VAL 


o 


17.672 


38.697 


19.432 


198 


ASN 


N 


15.627 


38.173 


18.937 


198 


ASN 


CA 


15.957 


37.626 


17.630 


198 


ASN 


CB 


16.220 


38.703 


16.520 


198 


ASN 


CG 


15.814 


38.095 


15.160 



198 ASN OD1 15.010 37.149 15.093 

198 ASN ND2 16.255 38.621 14.013 

198 ASN C 17.160 36.718 17.695 

198 ASN O 18.147 36.910 16.978 

199 VAL N 17.039 35.746 18.605 
199 VAL CA 18.096 34.791 18.849 
199 VAL CB 18.135 34.490 20.377 
199 VAL CGI 19.303 33.623 20.702 
199 VAL CG2 18.493 35.732 21.205 
199 VAL C 17.872 33.522 18.017 

199 VAL O 16.912 32.776 18.194 

200 GLN N 18.706 33.324 17.005 
200 GLN CA 18.771 32.144 16.138 
200 GLN CB 19.584 32.515 14.908 
200 GLN CG 19.819 31.348 13.964 
200 GLN CD 20.240 31.677 12.544 
200 GLN OKI 21.324 32.176 12.338 
200 GLN NB2 19.502 31.494 11.476 
200 GLN C 19.433 30.946 16.796 

200 GLN O 20.567 31.114 17.277 

201 SER K 18.810 29.760 16.799 



201 SER CA 19.476 26.564 17.356 
201 SER CB 19.283 28.526 18.891 
201 SER OG 20.089 27.563 19.530 
201 SER C 18.875 27.346 16.701 

201 SER O 18.062 27.448 15.779 

202 THR N 19.318 26.189 17.171 
202 THR CA 16.879 24.880 16.747 
2Q2 THR CB 19.769 23.760 17.461 
202 THR 0G1 19.869 24.043 18.866 
202 THR CG2 21.204 23.712 16.888 
202 THR C 17.412 24.706 17.082 

202 THR O 16.901 25.159 18.115 

203 TYR N 16.712 23.986 16.227 
203 TYR CA 15.286 23.728 16.398 
203 TYR CB 14.508 24.620 IS. 615 
203 TYR CG 13.165 25.140 16.239 
203 TYR CD1 13.129 25.884 17.421 
203 TYR CE1 11.918 26.223 17.992 
203 TYR CD 2 11.996 24.708 15.619 
203 TYR CE2 10.770 25.044 16.193 
203 TYR CZ 10.757 25.798 17.369 
203 TYR OH 9.560 26.166 17.949 
203 TYR C 14.941 22.322 15.901 

203 TYR 0 15.658 21.779 15.040 

204 PRO N 13.905 21.662 16.450 
204 PRO CD 13.057 22.111 17.596 
204 PRO CA 13.468 20.319 15.980 
204 PRO CB 12.178 20.026 16.797 
204 PRO CG 12.414 20.819 18.098 
204 PRO C 13.249 20.306 14.463 

204 PRO O 12.965 21.337 13.825 

205 GLY N 13.473 19.119 13.895 
205 GLY CA 13.358 18.927 12.435 
205 GLY C 14.643 19.310 11.724 

205 GLY O 14.632 19.630 10.535 

206 SKR N 15.770 19.252 12.442 
206 SER CA 17.067 19.586 11.924 
206 SSR CB 17.523 18.417 11.036 
206 SER OG 17.461 17.216 11.797 
206 SER C 17.098 20.931 11.175 

206 SER O 17.591 21.045 10.047 

207 THR N 16.566 21.968 11.842 
207 THR CA 16.518 23.294 11.258 
207 THR CB 15.070 23.518 10.667 
207 THR OGl 15.190 24.695 9.866 
207 THR CC2 13.924 23.606 11.700 
207 THR C 16.928 24.275 12.354 

207 THR O 17.600 23.908 13.342 

208 TYR N 16.632 25.546 12.113 
208 TYR CA 17.071 26.693 12.914 
208 TYR CB 18.333 27.321 12.307 
208 TYR CG 19.364 26.245 12.061 
208 TYR CD1 19.428 25.565 10.842 
208 TYR CE1 20.274 24.513 10.648 
208 TYR CD 2 20.152 25.869 13.110 
208 TYR CS2 20.978 24.825 12.917 
208 TYR Cr 21.039 24.151 11.713 
208 TYR OH 21.935 23.103 11.601 
208 TYR C 15.936 27.689 12.911 

208 TYR O 15.224 27.863 11.906 

209 ALA N 15.728 28.316 14.076 
209 ALA CA 14.653 29.234 14.266 
209 ALA CB 13.489 28.384 14.707 
209 ALA C 15.041 30.312 15.266 

209 ALA O 16.021 30.178 16.019 

210 SER N 14.378 31.450 15.089 
210 $ER CA 14.567 32.642 15.?!4 
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210 SER CB 14.614 33.893 IS. 065 
210 SIR 00 15.766 33.756 14.342 
210 SIR C 13.456 32.819 16.920 

210 SKR O 12.255 32.689 16.610 

211 LEU N 13.895 33.079 16.152 
211 LKU CA 12.990 33.304 19.244 
211 LEU CB 12.963 32.069 20.118 
211 LEU CG 12.368 30.846 19.535 
211 LEU COl 12.346 29.857 20.657 
211 LEU CD 2 10.940 31.056 19.033 
211 LEU C 13.372 34.503 20.110 

211 LEU O 14.547 34.927 20.110 

212 ASN N 12.439 35.024 20.912 
212 ASN CA 12.734 36.191 21.741 
212 ASN CB 11.883 37.403 21.413 
212 ASN CG 11.961 37.853 19.972 
212 ASN OD1 12.979 38.246 19.415 
212 ASN ND2 10.841 37.797 19.283 
212 ASN C 12.354 35.787 23.156 

212 ASN O 11.336 35.119 23.350 

213 GLY N 13.070 36.197 24.217 
213 GLY CA 12.648 35.928 25.599 
213 GLY C 13.834 35.974 26.520 

213 GLY O 14.990 35.843 26.099 

214 THR N 13.583 36.141 27.832 
214 THR CA 14.656 36.016 28.629 
214 THR CB 14.204 36.523 30.242 
214 THR OG1 12.998 35.812 30.594 
214 THR CG2 14.014 38.055 30.271 
214 THR C 15.128 34.527 28.894 

214 THR O 16.253 34.214 29.302 

215 SER N 14.304 33.607 28.380 
215 SER CA 14.663 32.187 . 28.217 
215 SER CB 13.425 31.449 27.696 
21S SER OG 12.324 31.235 28.564 
215 SER C 15.860 31.981 27.237 

215 SER O 16.588 30.993 27.305 

216 MET N 16.039 32.907 26.272 
216 MET CA 17.165 32.901 25.324 
216 MET CB 16.776 33.575 24.055 
216 KET CG 15.843 32.791 23.121 
216 KET SD 14.133 32.519 23.660 
216 KET CE 14.311 30.783 23.925 
216 KET C 18*372 33.638 25.865 

216 MET O 19.506 33.386 25.460 

217 ALA N 18.136 34.558 26.845 
217 ALA CA 19.249 35.257 27.465 
217 ALA CB 18.739 36.485 28.240 
217 ALA C 19.991 34.343 28.432 

217 ALA O 21.223 34.249 28.386 

218 THR N 19.211 33.574 29.199 
218 THR CA 19.756 32.657 30.231 
218 THR CB 18.587 31.860 30.888 
218 THR OG1 17.719 32.837 31.429 
218 THR CG2 19.040 30.887 31.979 
218 THR C 20.824 31.704 29.700 

218 THR O 21.912 31.648 30.275 

219 PRO II 20.683 31.008 28.586 
219 PRO CD 19.479 30.643 27.793 
219 PRO CA 21.708 30.099 28.089 
219 PRO CB 21.074 29.364 26.909 
219 PRO CG 19.943 30.268 26.471 
219 PRO C 23.027 30.765 27.704 

219 PRO O 24.060 30.108 27.745 

220 HIS K 22.994 32.051 27.345 
220 HIS CA 24.239 32.770 27.094 
220 HIS CB 23.997 34.219 26.600 



220 HIS CG 23.307 34.345 25.237 
220 HIS CD2 24.010 34.501 24.048 
220 HIS ND1 21.999 34.359 24.936 
220 HIS CE1 21.649 34.518 23.642 
220 HIS NE2 23.064 34.607 23.115 
220 HIS C 25.048 32.824 28.410 

220 HIS O 26.276 32.626 28.383 

221 VAL N 24.370 32.933 29.566 
221 VAL CA 25.084 32. $89 30.830 
221 VAL CB 24.180 33.727 31.843 
221 VAL CGI 24.746 33.674 33.267 
221 VAL CG2 24.119 35.194 31.366 
221 VAL C 25.477 31.606 31.299 

221 VAL O 26.612 31.424 31.734 

222 ALA N 24.617 30.614 31.120 
222 ALA CA 24.981 29.223 31.421 
222 ALA CB 23.871 28.283 31.032 
222 ALA C 26.229 28.786 30.670 

222 ALA O 27.129 26.121 31.204 

223 GLY N 26.258 29.180 29.388 
223 GLY CA 27.463 28.928 28.649 
223 GLY C 28.715 29.661 29.070 

223 GLY O 29.806 29.064 29.098 

224 ALA N 28.557 30.955 29.357 
224 ALA CA 29.706 31.677 29.842 
224 ALA CB 29.313 33.106 30.058 
224 ALA C 30.261 31.051 31.147 

224 ALA O 31.463 30.894 31.314 

225 ALA N 29.387 30.580 32.016 
225 ALA CA 29.771 29.836 33.221 
225 ALA CB 28.560 29.321 34.020 
225 ALA C 30.593 26.603 32.864 

225 ALA O 31.630 28.374 33.487 

226 ALA N 30.248 27.816 31.843 
226 ALA CA 31.033 26.664 31.490 
226 ALA CB 30.292 25.958 30.380 
226 ALA C 32.446 27.076 31.054 

226 ALA O 33.421 26.381 31.370 

227 LEU N 32.587 28.209 30.328 
227 LEU CA 33.888 26.734 29.901 
227 LEU CB 33.691 29.983 28.955 
227 LEU CG 32.901 29.762 27.666 
227 LEU CD1 32.816 31.015 26.813 
227 LEU CD2 33.598 28.704 26.902 
227 LEU C 34.782 29.060 31.088 

227 LEU O 35.954 28.623 31.131 

228 VAL N 34.176 29.711 32.105 
228 VAL CA 34.951 30.076 33.286 
228 VAL CB 34.114 31.094 34.168 
228 VAL CGI 34.822 31.451 35.502 
228 VAL CG2 33.950 32.402 33.362 
228 VAL C 35.340 28.614 34.074 

228 VAL O 36.468 28.777 34.573 

229 LYS N 34.502 27.781 34.115 
229 LYS CA 34.817 26.566 34.865 
229 LYS CB 33.575 25.679 34.978 
229 LYS OG 33.758 24.324 35.713 
229 LYS CD 34.180 24.479 37.170 
229 LYS CE 34.230 23.097 37.844 
229 LYS HZ 34.394 23.211 39.298 
229 LYS C 35.919 25.792 34.170 

229 LYS O 36.804 25.233 34.841 

230 GUI II 35.915 25.679 32.835 
230 GLN CA 37.001 24.957 32.188 
230 GLN CB 36.692 24,852 30.683 
230 GUI CG 37.819 24.181 29.916 
230 GUI CD 37.806 24.343 28.410 
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230 OLN OKI 36.941 24.907 27.731 

230 OLN NE2 36.866 23.779 27.664 

230 CLN C 36.324 2S.710 32.453 

230 GLN O 39.365 25.106 32.722 

231 LYS N 36.320 27.043 32.369 
231 LYS CA 39.482 27.877 32.678 
231 LYS CB 39.085 29.347 32.389 
231 LYS CG 40.041 30.518 32.637 
231 LYS CD 41.380 30.478 31.945 
231 LYS CB 42.078 31.872 31.997 
231 LYS NZ 42.377 32.352 33.343 
231 LYS C 39.970 27.715 34.142 

231 LYS O 41.173 27.658 34.409 

232 ASN N 39.023 27.635 35.097 
232 ASN CA 39.292 27.588 36.520 
232 ASN CB 38.801 28.646 37.227 
232 ASN CG 39.339 30.115 36.617 
232 ASN OD1 40.486 30.464 36.859 
232 ASN ND2 38.537 30.834 3S.845 
232 ASN C 38.59S 26.402 37.158 

232 ASN O 37.63S 26.555 37.907 

233 PRO N 39.057 25.173 36.945 
233 PRO CD 40.245 24.847 36.150 
233 PRO CA 38.320 23.978 37.376 
233 PRO CB 39.053 22.819 36.729 
233 PRO CG 40.441 23.367 36.519 
233 PRO C 38.155 23.820 38.863 

233 PRO O 37.266 23.094 39.274 

234 SER N 38.962 24.489 39.675 
234 SER CA 38.725 24.374 41.124 
234 SER CB 40.005 24.643 41.961 
234 SER OG 40.378 26.007 41.847 
234 SER C 37.635 25.309 41.680 

234 SER O 37.203 25.124 42.824 

235 TRP N 37.151 26.270 40.878 
235 TRP CA 36.213 27.246 41.393 
235 TRP CB 36.022 28.366 40.435 
235 TRP CG 37.165 29.323 40.391 
235 TRP CD2 37.103 30.539 39.761 
235 .TRP CE2 38.384 31.011 39.929 
235 TRP CE3 36.167 31.261 39.083 
235 TRP CD1 38.405 29.059 40.930 
235 TRP NE1 39.136 30.109 40.623 
235 TRP CZ2 38.726 32.237 39.404 
235 TRP CZ3 36.502 32.474 38.559 
235 TRP CH2 37.775 32.956 38.720 
235 TRP C 34.862 26.643 41.637 

235 TRP O 34.427 25.726 40.941 

236 SER N 34.206 27.137 42.669 
236 SER CA 32.884 26.712 43.011 
236 SER CB 32.771 26.915 44.541 
236 SER OG 32.691 28.301 44.902 
236 SER C 31.891 27.549 42.200 

236 SER O 32.195 28.606 41.637 

237 ASN N 30.645 27.084 42.278 
237 ASN CA 29.495 27.743 41.705 
237 ASN CB 28.255 26.923 42.112 
237 ASN CG 27.966 26.679 43.605 
237 ASN OD1 28.706 27.112 44.495 
237 ASN ND2 26.851 26.017 43.928 
237 ASN C 29.388 29.219 42.117 

237 ASN O 29.255 30.109 41.266 

238 VAL N 29.592 29.555 43.414 
238 VAL CA 29.576 30.945 43.876 
238 VAL CB 29.553 30.919 45.442 
k3S v*Ai> wol 2S.767 32.294 46.097 
238 VAL CG2 28.199 30.344 45.805 



238 VAL C 30.741 31,770 43.322 
236 VAL O 30.584 32.955 42.971 

239 GLN N 31.903 31.146 43.161 
239 GLN CA 33.058 31.865 42.654 
239 GLN CB 34.348 31.007 42.712 
239 GLN CG 34.787 30.771 44.165 
239 GLN CD 36.001 29.647 44.293 
239 GLN OE1 35.946 28.629 44.354 
239 GLN NE2 37.174 30.441 44.326 
239 OLN C 32.611 32.264 41.203 

239 GLN O 33.124 33.398 40.784 

240 ILK N 32.261 31.291 40.463 
240 XLS CA 31.950 31.500 39.047 
240 XLE CB 31.410 30.186 38.368 
240 XLS CG2 31.025 30.399 36.876 
240 XLE CGI 32.503 29.161 38.463 
240 XLE CD 32.041 27.775 37.973 
240 XLE C 30.902 32.584 38.896 

240 XLE O 31.087 33.511 38.104 

241 ARG N 29.819 32.464 39.667 
241 ARG CA 28.769 33.495 39.638 
241 ARG CB 27.701 33.092 40.6S5 
241 ARG CG 26.634 34.192 40.895 
241 ARG CD 25.462 33.692 41.771 
241 ARG NE 24.364 34.639 41.945 
241 ARG CZ 23.323 34.340 42.749 
241 ARG NH1 22.325 35.215 42.920 
241 ARG NH2 23.252 33.149 43.371 
241 ARG C 29.313 34.923 39.937 

241 ARG O 29.037 35.874 39.200 

242 ASN N 30.153 35.073 40.959 
242 ASN CA 30.649 36.413 41.277 
242 ASN CB 31.391 36.455 42.609 
242 ASN CG 30.386 36.371 43.746 
242 ASN OD1 29.177 36.652 43.659 
242 ASN ND2 30.877 35.681 44.877 
242 ASN C 31.591 36.931 40.225 

242 ASN O 31.631 38.152 39.938 

243 KXS N 32.330 36.012 39.584 
243 BXS CA 33.284 36.451 38.593 
243 HXS CB 34.183 35.327 38.178 
243 BXS CG 35.409 35.790 37.413 
243 HXS CD 2 36.367 36.638 37.902 
243 BXS ND1 35.770 35.447 36.181 
243 BXS CS1 36.908 36.044 35.892 
243 BXS NS2 37.250 36.757 36.945 
243 HXS C 32.559 36.966 37.370 

243 HXS O 32.988 37.984 36.820 

244 LEU N 31.473 36.265 36.963 
244 LEU CA 30.709 36.649 35.801 
244 LEU CB 29.576 35.636 35.501 
244 LEU CG 29.971 34.234 34.958 
244 LEU CD1 28.719 33.367 34.841 
244 LEU CD 2 30.649 34.360 33.602 
244 LEU C 30.147 38.007 36.104 

244 LEU O 30.189 38.853 35.217 

245 LYS N 29.690 38.289 37.328 
245 LYS CA 29.178 39.632 37.654 
245 LYS CB 28.452 39.593 38.993 
245 LYS CG 27.193 38.687 38.928 
245 LYS CD 26.536 38.412 40.289 
245 LYS CE 25.811 39.677 40.573 
245 LYS RZ 25.221 39.607 41.886 
245 LY8 C 30.300 40.665 37.714 

245 LYS O 30.125 41.805 37.257 

246 AC:: K 21.4S2 .'.0.279 22.12$ 
246 ASN CA 32.579 41.194 38.352 
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251 LEU CD1 23.663 46.020 29.043 
251 LEU CD 2 25.595 47.430 26.481 
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256 LEU CO 16.565 49*634 34.134 
256 LEU CD1 16.919 50.462 32.887 
256 LEU C02 15.095 49.321 34.182 
256 LEU C ,18.284 46.162 134.546 

256 LEU 0 17.798 45.300 33.803 

257 TYR N 19*566 46*246 34.888 
257 TYR CA 20.590 45.390 34.268 
257 TYR CB 21.606 46*225 33*447 
257 TYR CG 20*957 47.1Q6 32*389 
257 TYR C01 20.349 46.459 31.337 
257 TYR CE1 19*733 47.179 30.384 
257 TYR C02 20.951 48.503 32.449 
257 TYR CS2 20*330 49.219 31.446 
257 TYR CZ 19.731 48.536 30.426 
257 TYR OH 19.142 49.131 29.335 
257 TYR C 21.424 44.557 35.226 

257 TYR O 22.226 43.739 34.776 

258 GLY N 21.305 44.756 36.542 
258 GLY CA 22.222 44.130 37.496 
258 GLY C 23.630 44.552 37.201 

258 GLY O 23.896 45.710 36.877 

259 SER N 24.511 43.586 37.273 
259 SER CA 25.897 43.856 36.955 
259 SER CB 26.747 42*633 37.239 
259 SER OG 26.779 42.518 38.660 
259 SER C 26.153 44.278 35.527 

259 SER O 27.225 44.856 35.285 

260 GLY N 25.225 44.013 34.600 
260 GLY CA 25.413 44.431 33.222 
260 GLY C 25.476 43.210 32.331 

260 GLY O 24.999 42.106 32.672 

261 LEU N 26.036 43.461 31.151 
261 LEU CA 26.105 42.461 30.087 
261 LEU CB 26.274 43.195 28.721 
261 LEU CG 26.349 42.381 27.424 
261 LEU C01 25.064 41.598 27.191 
261 LEU CD2 26.675 43.372 26.282 
261 LEU C 27.234 41.470 30.309 

261 LEU O 28.410 41.842 30.426 

262 VAL N 26.651 40.192 30.263 
262 VAL CA 27.872 39.161 30.432 
262 VAL CB 27.227 37.754 30.407 
262 VAL CGI 26.633 37.448 29.036 
262 VAL CG2 28.305 36.734 30.824 
262 VAL C 28.935 39.300 29.331 

262 VAL O 28.661 39.699 28.193 

263 ASN N 30.181 39.070 29.700 
263 ASN CA 31.271 39.216 28.755 
263 ASN CB 31.866 40.599 28.993 
263 ASN CG 33.072 40.880 28.136 
263 ASN OD1 33.666 40*009 27.502 
263 ASN ND2 33.498 42.124 28.143 
263 ASN C 32.250 38.068 28.945 

263 ASN O 33.119 37.994 29.826 

264 ALA N 32.136 37.126 28*030 
264 ALA CA 32*947 35.931 28*088 
264 ALA CB 32*528 34.857 27.080 
264 ALA C 34.404 36.250 27.801 

264 ALA O 35.2S9 35*517 28.331 

265 GLU N 34.752 37.304 27.054 
265 GLU CA 36.169 37.625 26.884 
265 GLU CB 36.346 38*768 25.842 
265 GLU CG 37.790 39.302 25.597 
265 GLU CO 38.470 40.138 26.723 
26S GLU OEl 39.623 39.854 27.100 
265 GLD OE2 37.835 41.060 27.255 
265 GLO C 36.745 38.057 28.227 
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265 CLU O 37.766 37,524 28.689 
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25.605 
32.346 
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14.055 
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26.065 
11.945 
19.643 
36.430 
13.501 
16.785 
28.911 
29.797 
6.968 
21.830 
18.231 
17.725 
34.461 
19.764 
13.211 
10.729 
22.023 
26.324 
30.661 
8.433 
32.021 
32.606 
27.916 
17.445 
16.527 
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17.440 
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